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DECLARATION UNDER 37 CFR §1.132 BY TOSHINARI SUGASAWA 

I, Toshinari Sugasawa, declare as follows: 

1 . I am a co-inventor of the subject matter described and claimed in the above-referenced 
patent application, U.S. Patent Application No. 09/3 19,724. A bibliography of relevant 
publications and my curriculum vitae are attached hereto. 



2. I have read and understand the non-final Office AcJiem (paper no. 16) that was mailed 
March 24, 2003, in the above-referenced application, including the rejection for lack of 
enablement under 35 U.S. Call 2, first paragraph. It is my understanding that the basis for the 
rejection is that the specification, while being enabled for polynucleotides encoding a protein of 
SEQ ID No. 14 and the portion thereof capable of binding ICYP, allegedly fails to provide 
sufficient guidance for polypeptides that are not 100% identical to SEQ ID No. 14, i.e. 9 
substitutions, deletions and insertions, but which still retain a desired property of the polypeptide 
of SEQ ID No. 14. 

3. The specification reports the identification and cloning of the human ICYP receptor 
following the identification of a human expressed sequence tag (EST) using the tBLASTn 
program. The human fragment was identified based on homology with a highly conserved 
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formic acid cleaved fragment of the rat protein (SEQ ID No. 6), which demonstrated almost 
100% homology to the corresponding human sequence identified (SEQ ID No. 5) (see Example 
3 starting on page 32). 

This suggests that this family of proteins is highly conserved across species, and that the same, 
well known methodology used to clone the human gene could also be used without undue 
experimentation to clone natural variants from other species. 

4. In line with the methodology reported in Example 3 of the specification, this declaration 
describes the results of tBLASTn database searches using the fragments represented by SEQ ID 
Nos. 5 and 6. As shown in the attached tBLASTn results, when conducted with the amino acid 
sequence of SEQ ID No. 5, the search revealed 14 clones showing 100% homology in the region 
of the fragment represented by SEQ ID No. 5. Most all of these clones reveal homologous 
animal ICYP receptors, including the receptor from human, mouse, C. elegans, zebra fish, etc. 
(Note that TM9SF3 (transmembrane protein 9 superfamily member 3) and SMBP (SM-1 1044 
binding protein) and ICYP receptor are the same protein.) 

5. When the tBLASTn search was conducted with the amino acid sequence of SEQ ID No. 

6. which comprises an "X" in the amino acid sequence of the formic acid cleaved fragment, 16 
clones were identified including chicken and fruit fly TM9SF3. Thus, the fragment used in 
Example 3 of the specification to identify human ICYP receptor cDNA using the tBLASTn 
program could also be used in the same manner to identify natural variants from other species. 
Furthermore, clones corresponding to the identified sequences could be readily isolated by one of 
skill in the art using the methodology outlined in Example 3 . 
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6. 



I further declare that all statements made herein of my own knowledge are true, and that 



all statements made on information and belief are believed to be true, and further, that these 
statements were made with the knowledge that willful false statements and the like so made are 
punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United States 
Code, and that such willful false statements may jeopardize the validity of the application or any 
patent issuing thereon. 
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TBLASTN 2.2.6 [Apr-09-2003] 
Reference : 

Altschul, Stephen F. . Thomas L. Madden, Alejandro A. Schsffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Liptnan (1997), 
"Gapped BLAST and PS I -BLAST: a new generation of protein database search 
programs". Nucleic Acids Res. 25:3389-3402. 

RID: 1054277721-021460-25890 
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(17 letters) 



Database: All GenBank+EMBL+DDBJ+PDB sequences (but no EST, STS, 
GSS, or phase 0, 1 or 2 HTGS sequences) 

1.772,460 sequences: 8.313,506,101 total letters 



If you have any problems or questions with the results of this search 
please refer to the BLAST FAQs 



Taxonomy reports 



Distribution of 23 Blast Hits on the Query Sequence 



Mouse-over to show defline and scores. Click to show alignments 



Color Key for Rlignnent Scores 



40-50 
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Sequences producing significant alignments: 



Score E 
(bits) Value 



gi 1 9755050 1 gb I AF2691 50. 1 1 AF2691 50 Homo sapiens transmembran. 
gi l9755048|gblAF269149. 1 1AF269149 Gal lus gal lus transmembra. 

gi 130026031 1 ref I NM 178146.11 Mus musculus RIKEN cDNA 181007. 

gi 126344651 Idbj IAK075807. 1 1 Mus musculus 10 day old male pa. 

gi 1 22760523 1 db j I AK074824. 1 1 Homo sapiens cDNA FLJ90343 f is.. 

g i 1 6650721 1 gb I AF1 1 6347. 1 1 AF1 1 6347 Homo sapiens SM-11044 bin. 

gi 11 7543267 1 ref I NM 067729.11 Caenorhabdi tis elegans transme. 

gi 1 1 4042725 Idbi I AK027788.1 1 Homo sapiens cDNA FLJ14882 f is. . 

gi 119111161 IreflNM 133352. 1 1 Mus musculus RIKEN cDNA 181007. 

gi 1 1 343591 4 1 gb I BC004799. 1 1 Mus musculus RIKEN cDNA 1810073M. 
g i 1 28278504 1 gb I BC046021 . 1 1 Danio rerio, Similar to expresse. 

gi 1 9755052 1 gb I AF2691 51 . 1 1AF269151 Mus musculus transmembran. 

gi 17021041 Idbi IAK000756. 1 1 Homo sapiens cDNA FLJ20749 fis. . 
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gi H8088072lgblBC020959. 1 1BC020959 HolRTsapi ens, clone MGC:. 

gi !24658498lreflNM 139722.21 Drosoph i I a me I anogaster CG1 059. . 

gi l17945956lgblAY071401.1l Drosophila me I anogaster RE48767 . . 
gi I30681999lref |NM_100970. 21 Arabidopsis tha liana endomembr.. 

gi 1 1 7 933309 1 gb 1 AF446365. 1 1AF446365 Arabidopsis thai i ana Atl. . 

gi 1 201 4841 4 1 gb I AY081 536. 1l Arabidopsis tha liana endomembr an. . 



gi 1 1 581 0406 1 gb I AY056242. 1 1 
gi l30793946lgblBT008600. 1 1 

gih5451171lgblAY054666.il 

gi l17380901lgblAY063907.il 



Arabidopsis tha liana putative en. 
Arabidopsis tha liana clone U1129. 

Arabidopsis tha Nana endomembr an. 

Arabidopsis tha liana putative en. 
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□ > gj 1 9755050 1 gb I AF2691 50. 1 1 AF2691 50 fill Homo sapiens transmembrane protein TM9SF3 (TM9SF3) mRNA. complete 
cds 

Length = 3592 

Score = 43.1 bits (100), Expect = 0.008 

Identities = 17/17 (100%), Positives = 17/17 (100%) 

Frame = +2 

Query: 1 DPSFFQHR I HWFS I FNS 17 

DPSFFQHR I HWFS I FNS 
Sbjct: 854 DPSFFQHR I HWFS I FNS 904 



□ > gi 1 9755048 1 gb I AF2691 49. 1 1 AF2691 49 Gallus gallus transmembrane protein TM9SF3 (TM9SF3) mRNA, partial 
cds 

Length = 234 

Score = 43.1 bits (100), Expect = 0.008 

Identities = 17/17 (100%). Positives = 17/17 (100%) 

Frame = +1 

Query: 1 DPSFFQHR I HWFS I FNS 17 

DPSFFQHR I HWFS I FNS 
Sbjct: 112 DPSFFQHR I HWFS I FNS 162 



Q> gi 130026031 Iref [NM 178146.11 ill Mus musculus RIKEN cDNA 1810073M23 gene (1810073M23Rik) , mRNA 
Length =1155 

Score = 43.1 bits (100), Expect = 0.008 

Identities = 17/17 (100%), Positives = 17/17 (100%) 

Frame = +1 

Query: 1 DPSFFQHR I HWFS I FNS 17 

DPSFFQHR I HWFS I FNS 
Sbjct: 631 DPSFFQHR I HWFS I FNS 681 



□ > gi 126344651 jdbi IAK075807. 1 1 S Mus musculus 10 day old male pancreas cDNA, RIKEN full-length 
enriched library, clone:l810073M23 product: unknown EST, 
fu 1 1 i nsert sequence 
Length = 2479 

Score = 43.1 bits (100), Expect = 0.008 

Identities = 17/17 (100%), Positives = 17/17 (100%) 

Frame = +2 

Query: 1 DPSFFQHR I HWFS I FNS 17 

DPSFFQHR I HWFS I FNS 
Sbjct: 749 DPSFFQHR I HWFS I FNS 799 



G > gi i 22760523 1 db i I AK074824. 11 HIS Homo sapiens cDNA FLJ90343 fis, clone NT2RP2002824, weakly similar 
to END0S0MAL P24A PROTEIN PRECURSOR 
Length = 2864 

http://vmw.ncbi.nlm.nih.gov/blast/Blast.cgi 
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Score = 43.1 bits (100). Expect = 0.008 
Identities = 17/17 (100%), Positives = 17/17 (100%) 
Frame = +1 

Query: 1 DPSFFQHR I HWFS I FNS 17 

DPSFFQHR I HWFS I FNS 
Sbjct: 145 DPSFFQHR I HWFS I FNS 195 



O > g i 1 6650721 IgblAFU 6347. 1 1 AF1 1 6347 fill! Homo sapiens SM-11044 binding protein mRNA. partial cds 
Length =1804 

Score = 43.1 bits (100). Expect = 0.008 

Identities = 17/17 (100%), Positives = 17/17 (100%) 

Frame = +1 

Query: 1 DPSFFQHR I HWFS I FNS 17 

DPSFFQHR I HWFS I FNS 
Sbjct: 604 DPSFFQHR I HWFS I FNS 654 



O > gi 1 17543267 1 ref lNM_067729. 1 1 H Caenorhabditi s elegans transmembrane protein TM9SF3 (66.6 kD), mRNA 
Length =1743 

Score = 43.1 bits (100), Expect = 0.008 

Identities = 17/17 (100%), Positives = 17/17 (100%) 

Frame - +1 

Query: 1 DPSFFQHR I HWFS I FNS 17 

DPSFFQHR I HWFS I FNS 
Sbjct: 610 DPSFFQHR I HWFS I FNS 660 



□ > gi 1 1 4042725 1 db M AK027788. 1 1 BEl Homo sapiens cDNA FLJ14882 fis, clone PLACE1003537. weakly similar 
to ENDOSOMAL P24A PROTEIN PRECURSOR 
Length = 2459 

Score = 43.1 bits (100), Expect = 0.008 

Identities = 17/17 (100%), Positives = 17/17 (100%) 

Frame = +3 

Query: 1 DPSFFQHR I HWFS I FNS 17 

DPSFFQHR I HWFS I FNS 
Sbjct: 510 DPSFFQHR I HWFS I FNS 560 



□ > gi 119111161 IreflNM 133352. 1 1 BEl Mus musculus R1KEN cDNA 1810073M23 gene (1 810073M23R i k) , mRNA 
Length = 3143 

Score = 43.1 bits (100). Expect = 0.008 

Identities = 17/17 (100%), Positives = 17/17 (100%) 

Frame = +2 

Query: 1 DPSFFQHR I HWFS I FNS 17 

DPSFFQHR I HWFS I FNS 
Sbjct: 713 DPSFFQHR I HWFS I FNS 763 



□ > gi 1 1 343591 4 1 gb I BC0O4799. 1 1 SHU Mus musculus RIKEN cDNA 1810073M23 gene. mRNA (cDNA clone MGC:8136 
IMAGE: 3589474). complete cds 
Length = 2848 

Score = 43.1 bits (100), Expect = 0.008 

Identities = 17/17 (100%). Positives = 17/17 (100%) 

Frame = +2 

Query: 1 DPSFFQHR I HWFS I FNS 17 

DPSFFQHR I HWFS I FNS 
Sbjct: 410 DPSFFQHR I HWFS I FNS 460 



O > gi i 28278504 jgb I BC046021. 1 1 Danio rerio, Similar to expressed sequence AW146116, clone 
MGC: 56246 I MAGE: 5601 580. mRNA. complete cds 
Length = 2082 
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Score = 43.1 bits (100), Expect = 0.' 

Identities = 17/17 (100%), Positives = 17/17 (100%) 
Frame = +2 

Query: 1 DPSFFOHR I HWFS I FNS 17 

DPSFFQHR I HWFS I FNS 
Sbjct: 725 DPSFFQHR I HWFS I FNS 775 



HI > fii 1 9755052 1 gb I AF2691 51 . 1 1 AF2691 51 !HB Mus musculus transmembrane protein TM9SF3 (Tm9sf3) mRNA. complete 
cds 

Length = 3143 

Score = 43.1 bits (100). Expect = 0.008 

Identities = 17/17 (100%), Positives = 17/17 (100%) 

Frame - +2 

Query: 1 DPSFFQHR I HWFS I FNS 17 

DPSFFQHR I HWFS I FNS 
Sbjct: 713 DPSFFQHR I HWFS I FNS 763 



□ > gi 17021041 Idb i IAK000756. 1 1 Ml Homo sapiens cDNA FLJ20749 f is, clone HEP05301 
Length = 2973 

Score = 43.1 bits (100), Expect = 0.008 

Identities = 17/17 (100%). Positives = 17/17 (100%) 

Frame - +2 

Query: 1 DPSFFQHR I HWFS I FNS 17 

DPSFFQHR I HWFS I FNS 
Sbjct: 257 DPSFFQHR I HWFS I FNS 307 



□ > gi l18088072|gblBC020959. 1 IBC020959 Iffllll Homo sapiens, clone MGC: 8842 IMAGE: 385001 5, mRNA, complete cds 
Length = 2870 

Score = 43.1 bits (100), Expect = 0.008 

Identities = 17/17 (100%). Positives = 17/17 (100%) 

Frame = +2 

Query: 1 DPSFFQHR I HWFS I FNS 17 

DPSFFQHR I HWFS I FNS 
Sbjct: 434 DPSFFQHR I HWFS I FNS 484 



O > gi 1 24658498 1 r ef I NH 139722.21 H Drosophila melanogaster CG10590-PA [Drosophila melanogaster] 
(CG10590) mRNA, complete cds 
Length = 2663 

Score = 42.0 bits (97), Expect =0.017 

Identities = 16/17 (94%), Positives = 17/17 (100%) 

Frame = +1 

Query: 1 DPSFFQHR I HWFS I FNS 17 

DP+FFQHR I HWFS I FNS 
Sbjct: 823 DPNFFQHR I HWFS I FNS 873 



□ > gi 11 7945956 leb I AY071401. 1 1 BUS Drosoph i I a melanogaster RE48767 full length cDNA 
Length = 2662 

Score = 42.0 bits (97), Expect = 0.017 

Identities = 16/17 (94%), Positives = 17/17 (100%) 

Frame = +1 

Query: 1 DPSFFQHR I HWFS I FNS 17 

DP+FFQHR I HWFS I FNS 
Sbjct: 823 DPNFFQHR I HWFS I FNS 873 



O > gi j 30681 999 1 r ef I NM 100970.21 Arabidopsis thaliana endomembrane protein 70. putative (At1g10950) 
mRNA. complete cds 
Length = 2238 

Score = 34.7 bits (78). Expect = 2.7 
http://www.ncbi.nlm.nih.gov/blast/Blast.cgi 
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Identities = 13/17 (76%), Positives (88%) 
Frame = +2 

Query: 1 DPSFFQHRIHWFSIFNS 17 

D FF+H+IHWFSIFNS 
Sbjct: 740 DYPFFEHQ I HWFS I FNS 790 



□ > ei 1 1 7933309 1 gb I AF446365. 1 1AF446365 H Arabidopsis tha liana At1g10950/T19D16J3 mRNA, complete cds 
Length = 2094 

Score = 34.7 bits (78). Expect = 2.7 

Identities = 13/17 (76%), Positives = 15/17 (88%) 

Frame = +2 

Query: 1 DPSFFQHRIHWFSIFNS 17 

D FF+H+IHWFSIFNS 
Sbjct: 740 DYPFFEHQ I HWFS I FNS 790 



O > gi 120148414 |gb IAY081536. 1 1 IS Arabidopsis thai i ana endomembrane protein EMP70 precusor i so log 
(At1g10950) mRNA, complete cds 
Length = 1888 

Score = 34.7 bits (78). Expect = 2.7 

Identities = 13/17 (76%). Positives = 15/17 (88%) 

Frame - +1 

Query: 1 DPSFFQHRIHWFSIFNS 17 

D FF+H+IHWFSIFNS 
Sbjct: 634 DYPFFEHQ I HWFS I FNS 684 



D > gi 11581 0406 1 gb I AY056242. 1 1 El Arabidopsis thaliana putative endomembrane protein EMP70 precusor 
isolog (At1g10950) mRNA. partial cds 
Length = 1650 

Score = 34.7 bits (78), Expect = 2.7 

Identities = 13/17 (76%), Positives = 15/17 (88%) 

Frame = +2 

Query: 1 DPSFFQHRIHWFSIFNS 17 

D FF+H+IHWFSIFNS 
Sbjct: 251 DYPFFEHQ I HWFS I FNS 301 



O > gi 1 30793946 1 gb I BT008600. 1 1 Arabidopsis thaliana clone U11293 putative endomembrane protein 70 
(At1g10950) mRNA, complete cds 
Length =1966 

Score = 34.7 bits (78), Expect = 2.7 

identities = 13/17 (76%), Positives = 15/17 (88%) 

Frame = +1 

Query: 1 DPSFFQHRIHWFSIFNS 17 

D FF+H+IHWFSIFNS 
Sbjct: 634 DYPFFEHQ I HWFS I FNS 684 



fj > ei 115451171 |gblAY054666. 1 I IS Arabidopsis thaliana endomembrane protein EMP70 precusor isolog 
(At1g10950; T19D16. 13) mRNA. complete cds 
Length = 2062 

Score = 34.7 bits (78). Expect = 2.7 

Identities = 13/17 (76%), Positives = 15/17 (88%) 

Frame = +3 

Query: 1 DPSFFQHRIHWFSIFNS 17 

D FF+H+IHWFSIFNS 
Sbjct: 693 DYPFFEHQ I HWFS I FNS 743 



D > gi 117380901 |gblAY063907. 1 1 H Arabidopsis thaliana putative endomembrane protein EMP70 precusor 
isolog (At1g 10950) mRNA. complete cds 
Length =2113 
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Score = 34.7 bits (78). Expect = 2. 7 1 



Identities = 13/17 (76%). Positives = 15/17 (88%) 
Frame = +3 

Query: 1 DPSFFQHR I HWFS I FNS 17 

D FF+H+IHWFSIFNS 
Sbjct: 717 DYPFFEHQ I HWFS I FNS 767 



Database: All GenBank+EMBL+DDBJ+PDB sequences (but no EST, STS, GSS, 
or phase 0, 1 or 2 HT6S sequences) 

Posted date: May 30. 2003 12:03 AM 
Number of letters in database: -276,428,487 
Number of sequences in database: 1,772,460 

Lambda K H 

0.337 0.144 0.553 

Gapped 

Lambda K H 

0.267 0.0410 0.140 



Matrix: BL0SUM62 

Gap Penalties: Existence: 11, Extension: 1 

Number of Hits to DB: 258.839,836 

Number of Sequences: 1772460 

Number of extensions: 955261 

Number of successful extensions: 13875 

Number of sequences better than 10.0: 23 

Number of HSP' s better than 10.0 without gapping: 13550 

Number of HSP' s successfully gapped in prelim test: 0 

Number of HSP' s that attempted gapping in prelim test: 0 

Number of HSP' s gapped (non-prelim): 13873 

length of query: 17 

length of database: 2.771,168,700 

effective HSP length: 0 

effective length of query: 24 

effective length of database: 2.771.168.700 

effective search space: 66508048800 

effective search space used: 66508048600 

frameshi ft window, decay const: 50, 0.1 

T: 13 

A: 40 

X1: 15 (7.3 bits) 
X2: 38 (14.6 bits) 
X3: 64 (24.7 bits) 
Si: 39 (21.7 bits) 
S2: 73 (32.7 bits) 
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D 1: AF269150. Homo sapiens tran...[gi:9755050] 



LOCUS AF269150 3592 bp mRNA linear PRI 09-AUG-2000 

DEFINITION Homo sapiens transmembrane protein TM9SF3 (TM9SF3) mRNA, complete 
cds. 

ACCESSION AF269T50 

VERSION AF269150. 1 Gl:9755050 

KEYWORDS 

SOURCE Homo sapiens (human) 
ORGAN I SM Homo sapiens 

Eukaryota; Metazoa: Chordata; Craniata; Vertebrata; Eute leostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 3592) 
AUTHORS Warner, S.J. and Lomax, M. I. 

TITLE Evolution of the TM9 Super Fami ly of Membrane Spanning Proteins 
JOURNAL Unpublished 
REFERENCE 2 (bases 1 to 3592) 
AUTHORS Warner, S.J. and Lomax, M. I . 
TITLE Direct Submission 

JOURNAL Submitted (10-MAY-2000) Kresge Hearing Research Institute, 

University of Michigan, 9301E MSRB Ml, 1150 West Medical Center 
Drive. Ann Arbor, Ml 48109-0648, USA 
FEATURES Locat i on/Qua I if iers 

source 1. .3592 

/organismF"Homo sapiens* 
/mo I _type="mRNA" 
/db_xref="taxon:9606" 
/chromosome="10" 
/map="10q23-q24" 
gene 1 . . 3592 

/gene="TM9SF3" 
CDS 218.. 1987 

/gene=~TM9SF3" 

/nets— "'t r arisHiSirijr 2ns protein 9 SLipsrfsirii !y member 3 
/codon_start=1 

/product="transmembrane protein TM9SF3" 
/protein id=" AAF98159. V 
/db_xref="G I: 9755051" 

/trans I at i on="MRPLPGALGVAAAAALWLLLLLLPRTRADEHEHTYQDKEEVVLW 
MNTVGPYHNRQETYKYFSLPFCVGSKKS I SHYHETLGEALQGVELEFSGLD I KFKDDV 
MPATYCE I DLDKEKRDAFVYA I KNHYWYQMY I DDLP I WG I VGEADENGEDYYLWTYKK 
LE I GFNGNR I VDVNLTSEGKVKLVPNTK I Q I SYSVKWKKSDVKFEDRFDKYLDPSFFO 
HR I HWFS I FNSFMMV I FLVGLVSM I LMRTLRKDYARYSKEEEMDDMDRDLGDEYGWKQ 
VHGDVFRPSSHPL I FSSL I GSGCQ I FAVSL I V 1 1 VAM I EDLYTERGSMLSTA I FVYAA 
TSPVNGYFGGSLY ARQGGRRW I KQMF I GAFL I PAMVCGTAFF I NF I A I YYHASRA I PF 
GTMVAVCC I CFFV I LPLNLVGT I LGRNLSGQPNFPCRVNAVPRP I PEKKWFMEPAV I V 
CLGG I LPFGS I F I EMYF I FTSFWAYK I YYVYGFMMLVLV I LC I VTVCVT I VCTYFLLN 
AEDYRWQWTSFLSAASTA I YVYMYSFYYYFFKTKMYGLFQTSFYFGYMAVFSTALG I M 
CGA I GYMGTSAFVRK I YTNVK I D" 

BASE COUNT 1026 a 604 c 815 g 1147 t . 

ORIGIN 

1 gaggaagagg ctgaggaggc gcggggggcg ggggaggctc aggagcgggc ggtgacggcg 
61 acggcggcgg cagaggaggc agcggctggg ccgggccccg tgcgtctgtc cgcgccccgt 
121 ggatgcgaat cggccgcggc ggaggcggcg gcggcggagg aggcggcggc gggaggagga 
181 gtcggtgagc cggctccggg ccggaggggc gcggaggatg aggccgctgc ctggcgctct 
241 tggcgtggcg gcggccgccg cgctgtggct gctgctgctg ctgctgcccc ggacccgggc 
301 ggacgagcac gaacacacgt atcaagataa agaggaagtt gtcttatgga tgaatactgt 
361 tgggccctac cataatcgtc aagaaacata taagtacttt tcacttccat tctgtgtggg 
421 gtcaaaaaaa agtatcagtc attaccatga aactctggga gaagcacttc aaggggttga 
481 attggaattt agtggtctgg atattaaatt taaagatgat gtgatgccag ccacttactg 
541 tgaaattgat ttagataaag aaaagagaga tgcatttgta tatgccataa aaaatcatta 
601 ctggtaccag atgtacatag atgatttacc aatatggggt attgttggtg aggctgatga 
661 aaatggagaa gattactatc tttggaccta taaaaaactt gaaataggtt ttaatggaaa 
721 tcgaattgtt gatgttaatc taactagtga aggaaaggtg aaactggttc caaatactaa 
781 aatccagatt tcatattcag taaaatggaa aaagtcagat gtgaaatttg aagatcgatt 
841 tgacaaatat cttgatccgt ccttttttca acatcggatt cattggtttt caattttcaa 
901 ctccttcatg atggtgatct tcttggtggg cttagtttca atgattttaa tgagaacatt 
961 aagaaaagat tatgctcggt acagtaaaga ggaagaaatg gatgatatgg atagagacct 
1021 aggagatgaa tatggatgga aacaggtgca tggagatgta tttagaccat caagtcaccc 
1081 actgatattt tcctctctga ttggttctgg atgtcagata tttgctgtgt ctctcatcgt 
1141 tattattgtt gcaatgatag aagatttata tactgagagg ggatcaatgc tcagtacagc 
1201 catatttgtc tatgctgcta cgtctccagt gaatggttat tttggaggaa gtctgtatgc 
1261 tagacaagga ggaaggagat ggataaagca gatgtttatt ggggcattcc ttatcccagc 
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tatggtgtgt ggcactgcct 



tcttc! 



tttcatagcc 



atttattacc 



atgcttcaag 



1381 agccattcct tttggaacaa tggtggccgt ttgttgcatc tgtttttttg ttattcttcc 

1441 tctaaatctt gttggtacaa tacttggccg aaatctgtca ggtcagccca actttccttg 

1501 tcgtgtcaat gctgtgcctc gtcctatacc ggagaaaaaa tggttcatgg agcctgcggt 

1561 tattgtttgc ctgggtggaa ttttaccttt tggttcaatc tttattgaaa tgtatttcat 

1621 cttcacgtct ttctgggcat ataagatcta ttatgtctat ggcttcatga tgctggtgct 

1681 ggttatcctg tgcattgtga ctgtctgtgt gactattgtg tgcacatatt ttctactaaa 

1741 tgcagaagat taccggtggc aatggacaag ttttctctct gctgcatcaa ctgcaatcta 

1801 tgtttacatg tattcctttt actactattt tttcaaaaca aagatgtatg gcttatttca 

1861 aacatcattt tactttggat atatggcggt atttagcaca gccttgggga taatgtgtgg 

1921 agcgattggt tacatgggaa caagtgcctt tgtccgaaaa atctatacta atgtgaaaat 

1981 tgactagaga cccaagaaaa cctggaactt tggatcaatt tctttttcat aggggtggaa 

2041 cttgcacagc aaaaacaaac aaacgcaaga agagatttgg gctttaacac actgggtact 

2101 ttgtgggtct ctctttcgtc ggtggcttaa agtaacatct atttccattg atcctaggtt 

2161 cttcctgact gctttctcca actgttcaca gcaaatgctt ggattttatg cagtaggcat 

2221 tactacagta catggctaat cttcccaaaa actagctcat taaagatgaa atagaccagc 

2281 tctcttcagt gaagaggaca aatagtttat ttaaagcatt tgttccaata aaataaatag 

2341 agggaaactt ggatgctaaa attacatgaa taggaatctt cctggcactt agtgtttcta 

2401 tgttattgaa aaatgatgtt ccagaaagat tacttttttc ctcttatttt tactgccatt 

2461 gtcgacctat tgtgggacat ttttatatat tgaatctggg ttcttttttg actttttttt 

2521 ttcccaatcc aacagcatcc ttttttttaa aagagagaat tagaaaatat taaatcctgc 

2581 atgtaatata tctgctgtca tcttagttgg accaacttcc catttattta tcttaaaact 

2641 atacagttac atcttaattc catccaaaga agatacagtt tgaagacaga agtgtactct 

2701 ctacaatgca atttactgta cagttagaaa gcaaagtgtt aaatggagaa gatacttgtt 

2761 tttattaaac attttgagat ttagataaac tacattttaa ctgaatgtct aaagtgatta 

2821 tcttttttcc ccccaagtta gtcttaaatc ttttgggttt gaatgaaggt tttacataag 

2881 aaattattaa aaacaagggg ggtgggtaat aaatgtatat aacattaaat aatgtaacgt 

2941 aggtgtagat tcccaaatgc atttggatgt acagatcgac tacagagtac ttttttctta 

3001 tgatgattgg tgtagaaatg tgtgatttgg gtgggctttt acatcttgcc taccattgca 

3061 tgaaacattg gggtttcttc aaaatgtgtg tgtcatactt cttttgggag gggggttgtt 

3121 ttcttctgtt tattttctga gactcctaca ggagccaaat ttgtaattta gagacactta 

3181 attttgttaa tcctgtctgg gacacttaag taacatctaa agcattattg ctttagaatg 

3241 ttcaaataaa atttcctgac caaattgttt tgtggaaata gatgtgtttg caatttgaag 

3301 atatctttct gtccagaagg caaaattacc gaatgccatt tttaaaagta tgctataaac 

3361 tatgctactc tcatacaggg gacccgtatt ttaaaatctc cagacttgct tacatctaga 

3421 ttatccagca caatcataaa gtgaatgaca aaccctttga atgaaattgt ggcacaaaat 

3481 ctgttcaggt tggtgtaccg tgtaaagtgg ggatggggta aaagtggtta acgtactgtt 

3541 ggatcaacaa ataaaggtta cagttttgta agagaaaaaa aaaaaaaaaa aa 
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O 1: AF269149. Gallus gallus tra...[gi:9755048] 



Unks 



LOCUS AF269149 234 bp mRNA linear VRT 09-AUG-2000 

DEFINITION Gallus gallus transmembrane protein TM9SF3 (TM9SF3) mRNA, partial 
cds. 

ACCESSION AF269149 

VERSION AF269149.1 Gl :9755048 

KEYWORDS 

SOURCE Gallus gallus (chicken) 
ORGANISM Gallus gallus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi : 
Archosauria; Aves; Neognathae; Gal I i formes; Phasianidae; 
Phasianinae; Gal lus. 
REFERENCE 1 (bases 1 to 234) 
AUTHORS Gong.T.W.. He g etna n, A. D. , Sh i n r J. J. , Adler. H. J. , Raphael, Y. and 
Lomax, M. I. 

TITLE Identification of genes expressed after noise exposure in the chick 

basi lar papi I la 
JOURNAL Hear. Res. 96 (1-2). 20-32 (1996) 
MEDLINE 96414275 
PUBMED 8817303 
REFERENCE 2 (bases 1 to 234) 
AUTHORS Warner, S.J. and Lomax, M. I. 

TITLE Evolution of the TM9 Super Family of Membrane Spanning Proteins 
JOURNAL Unpublished 
REFERENCE 3 (bases 1 to 234) 
AUTHORS Warner, S.J. and Lomax, M. I . 
TITLE Direct Submission 

JOURNAL Submitted (10-MAY-2000) Kresge Hearing Research Institute, 

University of Michigan, 9301E MSR6 III. 1150 West Medical Center 
Drive, Ann Arbor. Ml 48109-0648. USA 
FEATURES Location/Qual i f iers 

source 1..234 

/organ i sm=*'Ga I lus ga 1 1 us " 
/mo l_type=*mRNA" 
/db_xref ="taxon : 9031 " 
/tissue_type="basi lar papi I la" 
gene <1..>234 

/gene="TM9SF3" 
CDS <1..>234 

/gene^TMgSFS" 

/note=*transmembrane protein 9 superfamily member 3" 
/codon_start=1 

/product="transmembrane protein TM9SF3* 
/pr ot e i n_ i d=" AAF98158. ) " 
/db_xref="G 1:9755049" 

/trans I at i on="ASEGKVKLVPNTKI QMSYSMKWKKSDVKFEDRFDKYLDPSFFQH 

R I HWFS I FNSFMMV I FLVGLVSM I LMRTLRKKKK* 
BASE COUNT 80 a 38 c 46 g 70 t 

ORIGIN 

1 gccagcgaag gaaaggtgaa actggtacca aatacaaaaa tccagatgtc atattctatg 
61 aagtggaaga agtcagatgt gaaatttgaa gaccgatttg acaagtacct tgacccatct 
121 ttctttcagc acagaattca ctggttttca attttcaatt ctttcatgat ggtgatcttt 
181 ctggttggcc tagtgtctat gatattaatg agaactttgc gaaaaaaaaa aaaa 

// 
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□ 1: NM.1 78146. Mus musculus RIKE...[gi:30026031] 



Links 



LOCUS 
DEFINITION 
ACCESSION 
VERSION 
KEYWORDS 
SOURCE 
ORGANISM 



REFERENCE 
AUTHORS 



1810073M23Rik 1155 bp mRNA linear ROD 18-MAY-2003 

Mus musculus R I KEN cDNA 1810073M23 gene (181 0073M23R i k) , mRNA. 
NMJ78146 

NMJ78146. 1 G 1:30026031 



TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 



Craniata; Vertebrata: Euteleostomi ; 
Sciurognathi ; Muridae; Murinae; Mus. 



Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria: Rodentia; 

1 (bases 1 to 1155) 
Strausberg, R. L. , Feingold. E. A. , Grouse, L. H. , Derge, J. G. . 
Klausner, R. D. , Col I ins, F. S. . Wagner, L. , Shenmen, C. M. , Schuler, G. D. , 
Altschul,S. F. , Zeeberg, B. , Buetow, K. H. , Schaef er, C. F. , Bhat.N. K. , 
Hopkins, R. F. , Jordan, H. , Moore, T. , Max, S.I. , Wang, J. . Hsieh.F. , 
Diatchenko, L. , Marusina, K. , Farmer, A. A., Rubin, G. M. , Hong, L. , 
Stapleton, M. , Soares,M. B. , Bona I do, M. F. , Casavant, T. L. , 

Scheetz, T. E. . Brownstein. M. J. . Usdin, T. B. , Toshiyuki , S. , 
Carninci,P. , Prange, C. , Raha, S. S. , Loquel lano, N. A. , Peters, G. J. , 
Abramson, R. D. , Mu 1 1 ahy, S. J. , Bosak, S. A. , McEwan, P. J. , 
McKernan, K. J. , Malek r J. A. , Gunaratne. P. H. , Richards, S. , 
Worley.K.C.. Hale.S., Garcta,A.M., Gay,L.J., Hulyk.S.W., 
Villalon.D.K.. Muzny, D. M. , Sodergren. E. J. . Lu, X. , Gibbs.R.A., 
Fahey, J. , Helton. E., Ketteman, M. . Madan, A. , Rodrigues.S. , 
Sanchez, A. , Wh i t i ng, M. , Madan. A. , Young, A. C. , Shevchenko, Y. , 
Bouffard.G. G. . B lakes ley. R. W. , Touchman, J. W. . Green. E.D.. 
Dickson, M. C. , Rodriguez. A. C. , Grimwood, J. , Schmutz.J. , Myers, R. M. , 
Butterfield, Y.S. , Krzywinski. M. I. , Skalska ( U. f Smai I us. D. E. , 
Schnerch, A. , Schein. J. E. . Jones, S. J. and Marra. M. A. 
Generation and initial analysis of more than 15,000 full-length 
human and mouse cDNA sequences 

Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) 

22388257 

12477932 

2 (bases 1 to 1155) 

Okazaki . Y. , Furuno, M. , Kasukawa, T. , Adachi f J. , Bono, H. , Kondo. S. , 
Nikaido, I.. Osato.N., Saito.R., Suzuki, H. , Yamanaka, I . , 
Kiyosawa, H. , Yagi.K. , Tomaru. Y; , Hasegawa, Y. , Nogami.A. , 
Schonbach. C. , Gojobori.T. , Baldarel I i , R. , Hi 1 1, D. P., Bult, C. , 
Hume, D. A., Quackenbush. J. . Schriml, L. M. , Kanapin, A. , Matsuda, H. , 
Batalov. S. , Beisel.K.W.. Blake, J. A.. Bradt, D. , Brusic.V.. 
Chothia, C. . Corbani , L. E. . Cousins, S. , Dal la, E. , Dragani , T. A. , 
Fletcher, C. F. . Forrest, A., Frazer, K. S. , Gaaster land, T. , 
Gariboldi, M. , Gissi.C, Godzik.A., Gough. J. , Gr immond. S. . 
Gustincich, S. , Hirokawa, N. , Jackson, I. J. , Jarvis, E. D. , Kanai.A. , 
Kawaji.H. . Kawasawa, Y. , Kedzi erski , R. M. , King, B. L. , Konagaya, A. , 
Kurochkin. I. V. , Lee,Y., Lenhard. B. , Lyons, P.A., Maglott, D. R. , 
Maltais, L. , March i onni, L. , McKenzie, L. , Miki.H. , Nagashima, T. , 
Numata, K. , Okido, T. , Pavan, W. J. , Pertea, G. . Peso I e, G. , 
Petrovsky, N. , Pillai.R., Pontius, J. U. , Gi.D. , Ramachandran, S. , 
Ravasi.T., Reed, J. C. , Reed, D. J. . Reid. J. , Ring, B. Z. , Ringwald.M., 
Sandelin, A. . Schneider, C. . Semple, C. A. , Setou, M. . Shimada, K. , 
Sultana, R. , Takenaka.Y. , Taylor, M.S., Teasdale, R. D. , Tomita,M. , 
Verardo. R. . Wagner, L. , Wahlestedt, C. . Wang. Y. , Watanabe. Y. , 
Wei Is. C. . Wi Iming. L. G. , Wynshaw-Bor i s. A: , Yanagi sawa, M. , Yang, I., 
Yang, L. , Yuan. Z. , Zavolan, M. , Zhu, Y. . Zimmer.A. . Carninci.P.. 
Hayatsu, N. . Hi rozane-Kishikawa, T. , Konno, H. . Nakamura.M. . 
Sakazume, N. . Sato, K. , Shiraki.T. , Waki.K. , Kawai.J. , Aizawa.K. , 
Arakawa, T. , Fukuda, S. . Hara, A. , Hashizume.W. . Imotani.K. . Ishii.Y. . 
Itoh.M. . Kagawa, I.. Miyazaki.A. . Sakai.K. , Sasaki, D. , Shibata, K. . 
Shinagawa, A. , Yasunishi, A. . Yoshino.M. . Waterston, R. , Lander. E. S. , 
Rogers, J.. Birney,E. and Hayashi zaki, Y. 

Analysis of the mouse transcr iptome based on functional annotation 
of 60,770 full-length cDNAs 
Nature 420 (6915), 563-573 (2002) 

22354683 ... 

12466851 

3 

The FANT0M Consortium and the RIKEN Genome Exploration Research 
Group Phase I & 1 1 Team. 

Analysis of the mouse transcr iptome based on functional annotation 
of 60,770 full-length cDNAs 
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JOURNAL Nature 420, 563-573 (20od^P 
COMMENT PROVISIONAL REFSEQ : This record has not yet been subject to final 

NCBI review. The reference sequence was derived from AK075807. 1 . 
FEATURES Locat ion/Qua I i f i ers 

source 1..1155 

/organ ism="Mus musculus" 
/mol_type="mRNA" 
/strain="C57BL/6J" 
/db_xref ="taxon : 1 0090" 
/chromosome=~1 9" 
gene 1 . . 1 1 55 

/gene="1810073M23Rik" 

/note="synonyms: SMBP. Tm9sf3. 281 0031 D16Rik" 
/db.xref ="Locus I D : 107358 " 
/db xref="MGI: 1914262 " 
CDS 1..1155 

/gene="1810073M23Rik" 

/note="putativeunknown EST (GB|BG073434. evidence: BLASTN, 
991 match=803); SM-11044 binding protein" 
/codon_start=1 

/product="transmembrane protein 9 superfami ly member 3" 
/protei n id=" NP 835359. 1 " 
/db_xref="G I: 30026032" 
/db.xref ="Locus I D : 107358 " 
/db xref="MGI: 1914262 " 

/trans I ati on="MRPLPGAPGVAAAAALLLLLLPRARSOEHEHTYQDKEEVVLWMN 
TVGPYHNRQETYKYFSLPFCVGSKKS I SHYHETLGEALQGVELEFSGLD I KFKDDVMP 
GTYCE I DLDKEKRDAFVYA I KNHYWYQMY I DDLP I WG I VGEADENGEDYYLWTYKKLE 
I GFNGNR I VDVNLTSEGKVKLVPNTK IQMSYSVKWKKSDVKFEDRFDKYLDPSFFQHR 
I HWFS 1 FNSFMMV I FLVGLVSM I LMRTLRKDYARYSKEEEMDDMDRDLGDEYGWKOVH 
GDVFRPSSHPL I FSSL I GSGCQ 1 FAVSL I V 1 1 VAM I EDLYTERGSMLSTA I FVYAATS 
PVNGYFGGSLYARQGGRRW I KDVYRG I PYPSYGVWHC I LHOLYSHLLPCL" 
misc_feature 1 60. . 1 071 

/gene="1810073M23Rik" 

/note="EMP70; Region: Endomembrane protein 70" 

/db_xref ="CDD : pfam02990 " 
BASE COUNT 340 a 199 c 278 g 338 t 
ORIGIN 

1 atgaggccgc tgccgggcgc tcctggcgtg gcggcggccg ccgcgttgtt gttgctgctg 
61 ctgccccggg ctcggtctga cgagcacgaa cacacgtatc aagataaaga agaagttgtt 

121 ttatggatga atactgttgg gccctaccac aatcgccaag aaacatataa atacttttca 

181 cttccattct gtgtggggtc aaaaaaaagt atcagtcatt accatgaaac tttgggagaa 

241 gcacttcaag gagttgaatt agaatttagt ggcctggaca ttaaatttaa agatgatgtg 

301 atgccaggca cttactgtga aattgactta gataaagaga agagagatgc gtttgtgtat 

361 gctatcaaga atcactattg gtatcagatg tacatagatg acttaccaat atggggtatt 

421 gttggtgaag cagatgaaaa tggggaagat tactatcttt ggacctacaa aaaacttgaa 

481 ataggcttta atggaaatcg aattgttgat gttaatctaa ctagtgaagg aaaggtcaag 

541 ctggttccaa atactaaaat acagatgtca tattcggtaa aatggaaaaa atcggatgta 

601 aaatttgaag atcgattcga taaatatctt gatccatcct tttttcagca taggattcac 

661 tggttttcaa tttttaattc cttcatgatg gtgatcttct tagtgggatt agtttcaatg 

721 attttaatga gaactttaag gaaagattat gcccgataca gtaaagaaga agaaatggat 

781 gacatggaca gagacctagg agacgagtat ggctggaagc aggtgcatgg agatgtgttc 

841 agaccgtcaa gtcaccctct gatcttctcc tccctcattg gctctggatg tcagatattt 

901 gctgtgtctc tcattgttat tattgttgcc atgatagagg acttatatac agagaggggg 

961 tcaatgctca gcacagccat atttgtctat gctgctacat cgccagtgaa cgggtacttt 
1021 ggaggaagtc tgtatgctag acaaggagga aggagatgga taaaggatgt ttatcggggc 
1081 attccttatc ccagctatgg tgtgtggcac tgcattcttc atcaacttta tagccattta 
1141 ttaccatgcc tctag 

// 
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□ 1: AF116347. Homo sapiens SM-1 ...[gi:6650721] 



Links 



LOCUS 
DEFINITION 
ACCESSION 
VERSION 
KEYWORDS 
SOURCE 
ORGANISM 



AF116347 1804 bp mRNA linear PRI 16-0CT-2001 

Homo sapiens SM-1 1044 binding protein mRNA, partial cds. 
AF1 16347 

AF1 16347.1 Gl :6650721 



Euteteostomi ; 
Homo. 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 
JOURNAL 

FEATURES 
source 



CDS 



Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa: Chordata; Craniata: Vertebrata: 
Mammalia; Eutheria; Primates: Catarrhini; Hominidae; 

1 (bases 1 to 1804) 

Sugasawa, T., Lenzen, G. , Simon, S. , Hidaka, J. , Cahen. A. , 
Qui I laume, J. L. , Camoin,L., Strosberg, A. D. and Nahmias, C. 
The i odocyanop i ndo I o I and SM-1 1044 binding protein belongs to the 
TM9SF mu Iti spanning membrane protein superfami ly 
Gene 273 (2), 227-237 (2001) 
21479397 
11595169 

2 (bases 1 to 1804) 

Sugasawa. T. . Lenzen, G. , Simon, S. . Hidaka. J., Cahen, A. . 
Gui I laume, J. L. , Camoin, L. , Nahmias, C. and Strosberg, A. D. 
Direct Submission 

Submitted (23-DEC-1998) Vet i gen, 21 rue Sebastien Mercier, Paris 
75015, France 

Locat i on/Qua I i f i ers 
1..1804 

/organism="Homo sapiens" 
/mol_type="rnRNA" 
/db_xref="taxon:9606" 
/t i ssue_type=" r ske I eta I muse I e" 

<1. . 1737 

/note="contains nine potential membrane spanning domains; 
similar to veast EMP70 protein precursor encoded by 
GenBank Accession Numbers X67316 and S25110" 
/codon_start=1 

/product="SM-11044 binding protein" 
/prote i n_i d=" AAF21 983. 1 " 
/db_xref="G I : 6650722" 

/trans I at i on="AAAALWLLLLLLPRTRADEHEHTYODKEEVVLWMNTVGPYHNRO 
ETYKYFSLPFCVGSKKS I SHYHETLGEALQGVELEFSGLD I KFKDDVMPATYCE I DLD 
KEKRDAFVYA I KNHYWYQMY I DDLP I WG I VGEADENGEDYYLWTYKKLE I GFNGNR I V 
DVNLTSEGKVKLVPNTK I QMSYSVKWKKSDVKFEDRFDKYLDPSFFOHR I HWFS I FNS 
FMMV I FLVGLVSM I LMRTLRKDYARYSKEEEMDDMDRDLGDEYGWKQVHGDVFRPSSH 
PL I FSSL I GSGCO I FAVSL I V 1 1 VAM I EDLYTERGSMLSTA I FVYAATSPVNGYFGGS 
LYARQGGRRW I KQMF I GAFL I PAMVCGTAFF INF I A I YYHASRA I PFGTMVAVCC I CF 
FV I LPLNLVGT I LGRNLSGQPNFPCRVNAVPRP I PEKKWFMEPAV I VCLGG I LPFGS I 
F I EMYF I FTSFWAYK I YYVYGFMMLVLV I LC I VTVCVT I VCTYFLLNAEDYRWQWTSF 
LSAASTA I YVYMYSFYYYFFKTKMYGLFQTSFYFGYMAVFSTALG I MCGA I GYMGTSA 
FVRKIYTNVKID" 

BASE COUNT 509 a 301 c 403 g 591 t 

ORIGIN 

1 gcggccgccg cgctgtggct getgetgetg ctgctgcccc ggacccgggc ggacgagcac 
61 gaacacacgt atcaagataa agaggaagtt gtcttatgga tgaatactgt tgggccctac 
121 cataategtc aagaaacata taagtacttt tcacttccat tctgtgtggg gtcaaaaaaa 
181 agtatcagtc attaccatga aactctggga gaagcacttc aaggggttga attggaattt 
241 agtggtctgg atattaaatt taaagatgat gtgatgccag ccacttactg tgaaattgat 
301 ttagataaag aaaagagaga tgcatttgta tatgecataa aaaatcatta ctggtaccag 
361 atgtacatag atgatttacc aatatggggt attgttggtg aggctgatga aaatggagaa 
421 gattactatc tttggaccta taaaaaactt gaaataggtt ttaatggaaa tcgaattgtt 
481 gatgttaatc taactagtga aggaaaggtg aaactggttc caaatactaa aatccagatg 
541 tcatattcag taaaatggaa aaagtcagat gtgaaatttg aagatcgatt tgacaaatat 
601 ettgatcegt ccttttttca acateggatt cattggtttt caattttcaa ctccttcatg 
661 atggtgatct tcttggtggg cttagtttca atgattttaa tgagaacatt aagaaaagat 
721 tatgeteggt acagtaaaga ggaagaaatg gatgatatgg atagagacct aggagatgaa 
781 tatggatgga aacaggtgea tggagatgta tttagaccat caagtcaccc actgatattt 
841 tcctctctga ttggttctgg atgtcagata tttgctgtgt ctctcatcgt tattattgtt 
901 gcaatgatag aagatttata tactgagagg ggatcaatgc tcagtacagc catatttgtc 
961 tatgetgeta cgtctccagt gaatggttat tttggaggaa gtctgtatgc tagacaagga 
1021 ggaaggagat ggataaagca gatgtttatt ggggcattcc ttatcccagc tatggtgtgt 
1081 ggcactgcct tcttcatcaa tttcatagee atttattacc atgcttcaag agccattcct 
1141 tttggaacaa tggtggccgt ttgttgcatc tgtttttttg ttattcttcc tctaaatctt 
1201 gttggtacaa tacttggccg aaatctgtca ggtcagccca actttccttg tcgtgtcaat 
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1261 gctgtgcctc gtcctatacc ggag a 

1321 ctgggtggaa ttttaccttt tggttcaatc 

1381 ttctgggcat ataagatcta ttatgtctat 

1441 tgcattgtga ctgtctgtgt gactattgtg 

1501 taccggtggc aatggacaag ttttctctct 

1561 tattcctttt actactattt tttcaaaaca 

1621 tactttggat atatggcggt atttagcaca 

1681 tacatgggaa caagtgcctt tgtccgaaaa 

1741 cccaagaaaa cctggaactt tggatcaatt 

1801 aaaa 



tggttcatgg agcctgcggt tattgtttgc 
tttattgaaa tgtatttcat cttcacgtct 
ggcttcatga tgctggtgct ggttatcctg 
tgcacatatt ttctactaaa tgcagaagat 
gctgcatcaa ctgcaatcta tgtttacatg 
aagatgtatg gcttatttca aacatcattt 
gccttgggga taatgtgtgg agcgattggt 
atctatacta atgtgaaaat tgactagaga 
tctttttcat aggggtggaa cttgcacagc 
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Details 



O 1: NM.067729. Caenorhabditis el...[gi:1 7543267] 



Links 



LOCUS 4C515 1743 bp mRNA linear INV 22-N0V-2002 

DEFINITION Caenorhabditis elegans transmembrane protein TM9SF3 (66.6 kD), 
mRNA. 

ACCESSION NM.067729 

VERS I ON NM.067729. 1 G 1 : 1 7543267 

KEYWORDS 

SOURCE Caenorhabditis elegans 
ORGAN I SM Caenorhabditis elegans 

Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; 
Rhabditoidea; Rhabdi tidae; Peloderinae: Caenorhabditis. 
COMMENT VALIDATED REFSEQ : This record has undergone preliminary review of 
the sequence, but has not yet been subject to final NCBI review. 
The reference sequence was derived from WormBase CDS:Y41D4A. 4. 
Fully supported by cDNA from the Worm Transcriptome Project. 
COMPLETENESS : full length. 
FEATURES Locat i on/Qua I i f i er s 

source 1..1743 

/organism="Caenorhabditis elegans" 
/mol_type="mRNA" 
/db_xr ef ="taxon : 6239" 
/chromosome="IV" 

/map="IV;-15.38 cM (interpolated genetic position)" 
/map="IV; covering 5823 bp. from base 1745879 to 1751703 
on genome release WS85" 

/clone="Primers to amplify the CDS (Stop included): 
ATGCGGGTACTGATAAGCC (T=57. 6) , 
TTAATCGATTTTCACAGTCTGATAGATTT (T=56. 4) " 
/clone_l ib=" [Predicted cellular localisation] PSORT 1 1 
analysis, (K. Nakat) trained on yeast data predicts that 
the subcellular location of this protein is most likely in 
the endoplasmic reticulum (66%) or in the plasma membrane 
(33%) . " 

gene 1..1743 

/gene="4C515" 
/locus_tag="4C515" 

/note="synonyms: Y41D4A.4, CELK01378" 
CDS 1..1743 

/gene="4C515" 
/IocusJ:ag="4C515" 

/f unct i on="Tr anspor ter GO : 000521 5 [Pf am/ 1 nt er pro/GO] " 
/f unct i on="Tr ansport GO : 000681 0 [Pf am/ 1 nter pro/GO] " 
/function^" Integra I membrane protein GO: 001 6021 
[Pfam/lnterpro/GO]" 

/note="|| [Common ancestor, from TaxBlast with threshold 
10-3] Eukaryota || [Closest homo logs] D i ctyoste I i urn 
discoideum: emb|CAC47950. 1 1 (score=271); Arabidopsis 
thaliana: ref|NP 563881. 1 | (528); Saccharomyces 
cerevisiae: ref|NP 010392. 1 1 (211): Schi zosaccharomyces 
pombe: ref|NP 596464. 1 1 (177); Caenorhabditis elegans: 
ref|NP 500130.1| (1028); Anopheles gambiae str PEST: 
gb|EAA09712. 1| (724); Drosophi la melanogaster : ref|NP 
647979.1| (728): Mus tnusculus: ref|NP 579930. 1 1 (760); 
Homo sapiens: ref|XP 050993. 1 1 (761)" 
/codon_start=1 

/product="Transmembrane protein TM9SF3 (66.6 kD)" 

/protein_id=" NP 500130.1 " 

/db.xref ="G I : 1 7543268" 

/db.xref ="AceVi ew/WormGenes : 4C515 " 

/db_xref ="CDD : pfam02990 " 

/db_x ref ="Locus I D : 176983 " 

/db_x r ef ="NextDB : CELK01378" 

/db.xref ="WormBase : Y41D4A.4 " 

/trans I at i on="MRVLI SLLLFTSLLVEADEHDHLYEVDEEVVLWMNTVGPYSNRO 
ETYTYFSLPFCKGEKKEIGHYHETMGESLLGVELEFSGLD I KFRTNTKKTWCEKKLT 
ETDYKTLLYA I QNSYYYQMYLDDMP I WGMVGE I DNTVNPPAYKLYTHKRLD I GYNDKQ 
VVDVNLTTDGRVD I RPGAELTFTYEVQWSKSEVDFTKRFDKYLDPSFFQHR I HWFS I F 
NSFMMWFLVGLVWM I LVRTLRKDYARYQKEDSLDDLDADLGDEYGWKOVHGDVFRPP 
SMPLLFSSC I GAGYHVFTVAV I TT I LA I VGEFYTERGSLLSAA I FVYAACSPVNGY AG 
GSMYARFGGRHW I KQMLLGAFLLPSMVCGVAFL I NF I A I YYHASRA I PFT I MLAVSA I 
CLFV I LPLTLVGTV I GRNMAGSADYPCRVNAVPRP I PDKKWFVQPWL I TLAGGVLPFG 
S I F I EMYF I FTSFWAYK I YYVYGFMLLVT 1 1 LS I VTVCVT I VCSYFLLNAEDYRWRWT 



http://www.ncbi.nlm.nih.gov/entrez/query .fcgi?cmd=Retrieve&db=Nucleotide&list_uids=1 7S43267&... 2003/05/3C 



NCBI. Sequence Viewer 



2/3 ^— v 



FKTKMYGLFOTVFYFGYMG I FAAALGLMTGT I GYVGT 



SFACGASTAFYVYMYl 
AKFVRKIYQTVKID" 
misc feature 1..51 

/gene^CSIS* 
/locus_tag="4C515" 

/note="Reg i on : [PSORT] N terminal signal domain" 
misc feature 133.. 1614 

/gene="4C515" 
/locus_tag="4C515" 

/note="Region: [Pfam/ inter Pro description] endometnbrane 
protein 70: the transmembrane 9 super f ami I y protein 
(TM9SF) may function as a channel or small molecule 
transporter. Proteins in this group are endosomal integral 
membrane proteins. HMMER score 820.10 Evalue=5. 1e-244" 
/db xref=~CDD: pfam02990 ~ 
misc_feature 658. . 708 

/gene="4C515~ 
/locus_tag="4C515" 

/note="Region: [PSORT] transmembrane domain: 
SFMMVVFLVGLVWM I L\T 
misc_feature 874. . 924 

/gene="4C515" 
/locus_tag="4C515 ,r 

/not e=" Reg ion: [PSORT] transmembrane domain: 
VFTVAV I TT I LA I VGEF" 
mi sc_feature 937. . 987 

/gene=~4C515" 
/locus_tag="4C515" 

/note="Reg i on : [PSORT] transmembrane domain: 
GSLLSAA I FVYAACSPV" 
mi sc_feature 1054. .1104 

/gene="4C515" 
/locus_tag="4C515" 

/note="Region: [PSORT] transmembrane domain: 
LGAFLLPSMVCGVAFLT 
misc_feature 1159. .1209 

/gene="4C515" 
/locus_tag="4C515" 

/note="Region: [PSORT] transmembrane domain: 
LAVSA I CLFV I LPLTLV 
misc feature 1312. .1362 

/gene="4C515" 
/locus_tag="4C515" 

/note="Region: [PSORT] transmembrane domain: 
WL I TLAGGVLPFGS I F I " 
mjgr. ffiatiire 1426. . 1476 

/gene="4C515" 
/locus.tag="4C515" 

/note="Region: [PSORT] transmembrane domain: 
LLVTI1LSIVTVCVTIV" 
misc feature 1 726. . 1 737 

/gene^CSIS" 
/locus_tag="4C515~ 

/note^Region: [PSORT] ER membrane domain: TVKT 
BASE COUNT 442 a 428 c 404 g 469 t 
ORIGIN 

1 atgcgggtac tgataagcct gctactattc acctcgctgc tcgtcgaagc cgacgaacac 
61 gaccatttgt acgaggtcga cgaggaggtt gtattatgga tgaacacggt gggaccctac 
121 tcgaatcgcc aggaaaccta cacctacttc tccctgccat tttgtaaagg agagaagaag 
181 gaaattggcc attatcacga aactatggga gaatcactgc tcggcgtgga gctagagttc 
241 tccggattgg atataaaatt cagaacaaac accaagaaaa ccgtggtttg cgagaagaag 
301 ctcaccgaaa ccgactataa aacgctgctc tacgctatcc aaaacagcta ttactatcaa 
361 atgtacctcg acgacatgcc gatttgggga atggttggag aaatcgataa tactgtgaat 
421 ccgccggcct acaagcttta tacgcataag aggctagata ttggctataa cgataagcag 
481 gtggtcgacg tgaatttgac aacagacgga agagttgata ttcgtcccgg cgctgagctc 
541 acgttcacct atgaggtgca atggagcaag tcggaagtgg attttaccaa gagattcgat 
601 aaatatctcg atccaagctt cttccagcac agaatccact ggttctcgat ttttaacagt 
661 ttcatgatgg tcgtattcct cgtcggattg gtatggatga ttctagtccg tactctacga 
721 aaagactacg cccgatacca aaaagaggat agtctcgacg atttggacgc cgatctcgga 
781 gatgagtacg gttggaaaca ggttcacgga gacgtcttcc gtccaccatc aatgccgttg 
841 ctcttctcat cgtgtattgg agccggttat catgtgttca ctgtcgcggt tattacgacg 
901 attttggcga ttgttggaga attttataca gaacgcggct ccctactctc cgccgcaatc 
961 ttcgtctacg ccgcttgctc gccagtcaac ggatatgccg gcggatcgat gtacgccaga 
1021 ttcggaggtc gtcactggat caagcaaatg ctcctcggag cattccttct gccaagtatg 
1081 gtgtgtggtg tcgcattcct tatcaacttt atcgcgatct actatcacgc gtcacgcgcc 
1141 attccattca caattatgct cgccgtctcg gcgatctgcc tctttgtgat ccttccgctg 
1201 actcttgtcg gaactgtgat cggaagaaat atggctggaa gtgcagacta tccgtgtaga 
1261 gtgaatgccg tgccgagacc cattcctgat aagaaatggt tcgtccaacc gtggctcatc 
1321 acccttgccg gaggtgttct cccattcgga tccatcttca tcgaaatgta ctttatcttc 
1381 acctcattct gggcgtacaa aatctattat gtgtacggat tcatgctttt ggtcacaata 
1441 atcctgtcaa ttgtcaccgt atgcgtcaca attgtctgct cctacttcct actcaacgcc 
1501 gaggattata gatggagatg gacgagcttc gcgtgtggag catctactgc attctatgtg 
1561 tacatgtatt ccatgtatta tttcttcttc aagacaaaaa tgtacggact tttccaaacg 
1621 gtcttctact ttggatacat gggaatcttc gccgccgctc tcggactcat gaccggcacc 
1681 atcggatacg tcggaaccgc caaattcgtc cgcaaaatct atcagactgt gaaaatcgat 
1741 taa 
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□ 1: BC046021. Danio rerio, Simi...[gi:28278504] 



LOCUS BC046021 2082 bp mRNA linear VRT 10-FEB-2003 

DEFINITION Danio rerio. Similar to expressed sequence AW1 461 16, clone 

MGC: 56246 I MAGE: 5601 580. mRNA. complete cds. 
ACCESSION BC046021 
VERSION BC046021. 1 Gl : 28278504 
KEYWORDS MGC. 

SOURCE Danio rerio (zebrafish) 
ORGAN I SM Danio rerio 

Eukaryota: Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Actinopterygi i ; Neopterygi i : Teleostei ; Ostariophysi ; 
Cyprini formes; Cyprinidae; Danio. 
REFERENCE 1 (bases 1 to 2082) 
AUTHORS Strausberg. R. 
TITLE Direct Submission 

JOURNAL Submitted (31 - JAN-2003) National Institutes of Health, Mammalian 
Gene Collection (MGC), Cancer Genomics Office, National Cancer 
Institute, 31 Center Drive. Room 11A03. Bethesda. MD 20892-2590. 
USA 

REMARK NIH-MGC Project URL: http://mgc.nci.nih.gov 
COMMENT Contact: MGC help desk 

Email: cgapbs-r@mail.nih.gov 

Tissue Procurement: Dr. Sumio Sugano 

cDNA Library Preparation: Dr. Sumio Sugano 

cDNA Library Arrayed by: The I.M.A. G. E. Consortium (LLNL) 

DNA Sequencing by: Sequencing Group at the Stanford Human Genome 

Center, Stanford University School of Medicine, Stanford, CA 94305 

Web site: http://www-shgc. Stanford, edu 

Contact: (Dickson. Mark) mcd@paxil.stanford.edu 

Dickson, M. , Schmutz, J.. Grimwood. J., Rodriquez, A. t and Myers, 

R. M. 

Clone distribution: MGC clone distribution information can be found 
through the I.M. A. G. E. Consortium/LLNL at: http:// image. I lnl.gov 
Series: IRAK Plate: 103 Row: I Column: 14. 
FEATURES Locat i on/Qua I i f i ers 

source 1..2082 

/organ ism=*Danio rerio* 

/mol_type=*mRNA* 

/db_xref =*taxon : 7955" 

/clone=*MGC:56246 I MAGE: 5601 580* 

/tissue_type=*Whole body, adult male* 

/c I one_ 1 i b="Sugano_S JD_adu I t_ma I e" 

/lab_host=*DH10B* 

/note=*Vector : pMEl 8S-FL3" 
CDS 98.. 1858 

/codon_start=1 

/product=*Simi lar to expressed sequence AW146116* 
/protei n_i d=" AAH46021 . 1 * 
/db_xref="G I: 28278505* 

/trans I at i on=*MGLCKWTFEWALTAVFSSF I PVRADEHEHKYTDKEEVVLWMNT 
VGPYHNRQETYKYFSLPFCVGTKKT I SHYHETLGEALQGVELEFSGLD I KFKEEVLQT 
TYCD I ELDKPKRDAFVYA I KNHYWYQMY I DDLP I WG I VGEADENGEDYYLWTYKKLE I 
GHNGNR I VDVNLTSEGKVKLVPNTR I PMSYSVKWKKSDVKFEDRFDKYLDPSFFOHR I 
HWFS I FNSFMMV I FLVGLVSM I LMRTLRKDYARYSKEEEMDDMDRDPGDEYGWKOVHG 
DVFRPSSHPL I FSSL I GSGCQ I FSVSL I V 1 1 LAM I EDLYTERGSMLSTA I FVYAATSP 
VNGYFGGSLYAKOGGRRW I KQMF I GAFL I PAMVCGTAFF I NF I A I YYHASRA I PFGTM 
VAVCC I CLFV I LPLNLVGT I LGRNLSGOPNFPCRVNAVPRP I PEKKWFMEPAV I VCLG 
G I LPFGS I F I EMYF I FTSFWAYK I YYVYGFMMLVLV I LC I VTVCVT I VCTYFLLNAED 
YRWQWTSFLSAASTAVYVYMYSFYYYFFKTKMYGLFQTSF YFGYMAVFSTALG I MCGA 
VGYMGTSAFVRK I YTNVK I D* 

BASE COUNT 601 a 424 c 478 g 579 t 

ORIGIN 

1 aagacaacca ctgtcataga accaggaaga aggaaggtgg gatagccccc aaaaagatac 
61 catttcaaat cagtcgcaga attttcccta gaaaaccatg gggctctgca agtggacgtt 
121 tgaagtggtt gccttgacgg ctgtttttag ttcgttcata ccagttcgag cagatgaaca 
181 cgagcacaag tacacagata aagaagaggt ggttttatgg atgaatacag tgggccctta 
241 tcacaaccga caggagacct ataagtactt ctccctaccg ttctgtgtgg gaacgaaaaa 
301 aactattagt cattatcatg agacattagg agaggcgcta cagggggttg aattggagtt 
361 cagcggtctg gacatcaagt tcaaagagga ggtgttgcag acgacatact gtgacattga 
421 actagacaaa cctaaaagag atgcatttgt ctatgccatc aagaaccact actggtacca 
481 aatgtacata gatgaccttc ccatctgggg aattgttggt gaagctgatg agaatggaga 
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541 ggactattac ctttggactt acaaa^^^ ggagatcgga cataatggga acagaattgt 

601 ggatgttaac ctaaccagtg agggcaaggt taagcttgtt ccaaacacga gaattcccat 

661 gtcatactct gtcaagtgga agaaatctga tgtgaaattt gaggatagat ttgacaaata 

721 ccttgatcca tctttctttc aacacagaat tcactggttc tcaatattca actcattcat 

781 gatggtgatt ttcctggtgg gccttgtttc aatgatcctg atgagaacat taaggaaaga 

841 ctatgccaga tacagcaaag aggaggaaat ggatgacatg gatcgtgatc ctggtgatga 

901 atatggctgg aagcaggttc acggggatgt gttcaggcca tccagtcacc cgctgatctt 

961 ctcctccctc attggctctg gctgtcaaat tttttcagtt tcccttatag tcataatttt 

1021 ggccatgatt gaggacttgt atacagagag agggtccatg ttgagtacag ccatatttgt 

1081 ttatgctgcc acgtctccag tcaatggcta ctttggtgga agcctatatg caaaacaagg 

1141 agggagaaga tggataaagc agatgttcat tggagccttc ctgattcctg caatggtgtg 

1201 tggtacagcg tttttcatca acttcatcgc catctactac catgcctcaa gagcaatccc 

1261 attcggcact atggttgcgg tgtgctgcat ctgcttattt gtaattctac ctctgaacct 

1321 ggtggggacg atcctgggca gaaacctctc cggacagccc aactttcctt gtagagtcaa 

1381 tgctgtaccc agacccatcc cagaaaagaa atggtttatg gagccagcag ttatcgtgtg 

1441 tttaggtggc atccttccat tcggctcgat attcattgaa atgtacttca ttttcacatc 

1501 tttctgggcc tacaagattt actatgtgta tggatttatg atgctggtgc tggtgatcct 

1561 gtgcatcgtt actgtctgcg tcaccattgt ctgcacctac ttccttctca atgcagaaga 

1621 ctacagatgg cagtggacaa gctttctctc tgcagcgtca acagcagttt atgtttacat 

1681 gtactccttc tactactatt tcttcaaaac aaagatgtat ggcttattcc agacgtcatt 

1741 ctactttggc tacatggctg tgttcagcac tgctctggga atcatgtgtg gagcggtggg 

1801 ttacatggga acaagtgcct ttgtgaggaa gatctacaca aatgtgaaaa tcgactaaaa 

1861 ctctgcagag acgcagcaga agagccgaca cttcctcccg atcgaaacgg gcacaaagtc 

1921 tccatttatt ttgcaagatg agattgtggt atcttgtaca aaatctagtt ttccattttc 

1981 tcaatgaatt aaacgttgcg gcatgatgca tacacaaggg taaaccgaga ggttaaataa 

2041 acaggacttt taattttaaa aaaaaaaaaa aaaaaaaaaa aa 
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TBLASTN 2.2.6 [Apr-09-2003] 
Reference : 

Altschul, Stephen F. , Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PS I -BLAST: a new generation of protein database search 
programs", Nucleic Acids Res. 25:3389-3402. 

RID: 1054282789-05952-1524 



Query= 



(17 letters) 



Database: All GenBank+EMBL+DDBJ+PDB sequences (but no EST, STS, 
GSS, or phase 0, 1 or 2 HTGS sequences) 

1, 772, 460 sequences: 8,313,506,101 total letters 



If you have any problems or questions with the results of this search 
please refer to the BLAST FAQs 

Taxonomy reports 



Distribution of 16 Blast Hits on the Query Sequence 



jMouse-over to show defline and scores. Click to show alignments 




Sequences producing significant alignments: 

1 9755048 1 gb I AF2691 49. 1 1AF269149 Gal lus gal lus transmembra. 



1 22760523 1 db j I AK074824. 1 I Homo sapiens cDNA FLJ90343 fis. 



130026031 1 ref INM 178146. 1 1 Mus musculus RIKEN cDNA 181007. 



1 26344651 Idb i I AK075807. 1 1 Mus musculus 10 day old male pa. 



1 6650721 1 gb I AF1 1 6347. 1 1 AF1 1 6347 Homo sapiens SM-1 1044 bin. . 



17021041 Idb iiAK000756. 11 Homo sapiens cDNA FLJ20749 fis, 



1 17543267 1 ref INM 067729.11 Caenorhabdi tis elegans transme. . 



H4042725ldbilAK027788.il Homo sapiens cDNA FLJ14882 fis, .. 



1 1 8088072 1 gb I BG020959. 1 1 BCQ20959 Homo sapiens, clone MGC: 



1 9755050 i gb I AF2691 50. 1 [AF269150 Homo sapiens transmembran. 



1 28278504 1 gb I BC046021 . 1 1 Danio rerio. Similar to expresse. 



119111161 Iref INM 133352. 1 1 Mus musculus RIKEN cDNA 181007. 



1 1 343591 4 1 gb I BC004799. 1 1 Mus musculus RIKEN cDNA 1810073M. 



1 9755052 1 eb I AF2691 51 . II AF2691 51 Mus musculus transmembran.. 



!24658498lreflNM 139722.21 Drosophi la melanogaster CG1059. . 



H7945956|gblAY071401.1l Drosophi la melanogaster RE48767 .. 



Score 
(bits) 

35 

35 

35 

35 

35 

35 

35 

35 

35 

35 
35 

35 

35 

35 

34 

34 



E 

Value 

2.7 

2.7 

2.7 

2.7 

2.7 

2.7 

2.7 

2.7 

2.7 

2.7 
2.7 

2.7 

2.7 

2.7 

3.6 

3.6 



A I ignments 
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n > gi 1 9755048 1 gb I AF269 1 49. 1 IAF269149 Gallus gallus transmembrane protein TM9SF3 (TM9SF3) mRNA, partial 
cds 

Length = 234 

Score = 34. 7 bits (78), Expect =2.7 

Identities = 14/16 (87%), Positives = 14/16 (87%) 

Frame = +1 

Query: 1 DPXFFQHR I HVFS I FN 16 

DP FFQHR I H FSIFN 
Sbjct: 112 DPSFFQHR I HWFS I FN 159 



O > g i 1 22760523 Idbil AK074824. 1 1 H Homo sapiens cDNA FLJ90343 fis. clone NT2RP2002824, weakly similar 
to ENDOSOMAL P24A PROTEIN PRECURSOR 
Length = 2864 

Score = 34.7 bits (78), Expect = 2.7 

Identities = 14/16 (87%), Positives = 14/16 (87%) 

Frame = +1 

Query: 1 DPXFFQHR I HVFS I FN 16 

DP FFQHR I H FSIFN 
Sbjct: 145 DPSFFQHR I HWFS I FN 192 



□ > gt 130026031 Iref INM 178146. 1 1 11 Mus musculus Rl KEN cDNA 1810073M23 gene (181O073M23Rik) . mRNA 
Length =1155 

Score = 34.7 bits (78), Expect = 2.7 

Identities = 14/16 (87%), Positives = 14/16 (87%) 

Frame = +1 

Query: 1 DPXFFQHR I HVFS I FN 16 

DP FFQHR I H FSIFN 
Sbjct: 631 DPSFFQHR I HWFS I FN 678 



n > gi 126344651 idbi IAK075807. 1 1 ill Mus musculus 10 day old male pancreas cDNA, RIKEN full-length 
enriched library, clone:l810073M23 product: unknown EST, 
full insert sequence 
Length = 2479 

Score = 34.7 bits (78), Expect = 2.7 

Identities = 14/16 (87%), Positives = 14/16 (87%) 

Frame = +2 

Query: 1 DPXFFQHR I HVFS I FN 16 

DP FFQHR I H FSIFN 
Sbjct: 749 DPSFFQHR I HWFS I FN 796 



O > gi 1 6650721 1 gb I AF1 16347. 11 AF1 16347 iOll Homo sapiens SM-11044 binding protein mRNA, partial cds 
Length = 1804 

Score = 34.7 bits (78), Expect = 2.7 

Identities = 14/16 (87%), Positives = 14/16 (87%) 

Frame = +1 

Query: 1 DPXFFQHR I HVFS I FN 16 

DP FFQHR I H FSIFN 
Sbjct: 604 DPSFFQHR I HWFS I FN 651 



□ > fii 17021041 Idbi IAK000756. 1 1 Mil Homo sapiens cDNA FLJ20749 fis. clone HEP05301 
Length = 2973 

Score = 34.7 bits (78), Expect = 2.7 

Identities = 14/16 (87%), Positives = 14/16 (87%) 

Frame = +2 

Query: 1 DPXFFQHR I HVFS I FN 16 
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DP FFQHRIH FS1FN 
Sbjct: 257 DPSFFQHR I HWFS I FN 304 



n > gi 1 17543267 1 ref iNM 067729. 1 1 H Caenorhabditis eiegans transmembrane protein TM9SF3 (66.6 kD), mRNA 
Length = 1743 

Score = 34.7 bits (78) , Expect =2.7 

Identities = 14/16 (87%), Positives = 14/16 (87%) 

Frame - +1 

Query: 1 DPXFFQHR I HVFS I FN 16 

DP FFQHRIH FSIFN 
Sbjct: 610 DPSFFQHR I HWFS I FN 657 



O > g i 1 1 4042725 Idbil AK027788. 1 1 IIH Homo sapiens cDNA FLJ14882 f is, clone PLACE1003537, weakly similar 
to ENDOSOMAL P24A PROTEIN PRECURSOR 
Length = 2459 

Score = 34.7 bits (78). Expect = 2.7 

Identities = 14/16 (87%), Positives = 14/16 (87%) 

Frame = +3 

Query: 1 DPXFFQHR I HVFS I FN 16 

DP FFQHRIH FSIFN 
Sbjct: 510 DPSFFQHR I HWFS I FN 557 



□ > gi i 1 8088072 1 gb I BC020959. 1 1BC020959 HIS Homo sapiens, clone MGC:8842 I MAGE: 385001 5. mRNA, complete cds 
Length =2870 

Score = 34.7 bits (78), Expect =2.7 

Identities = 14/16 (87%), Positives = 14/16 (87%) 

Frame = +2 

Query: 1 DPXFFQHR I HVFS I FN 16 

DP FFQHRIH FSIFN 
Sbjct: 434 DPSFFQHR I HWFS I FN 481 



□ > gi 1 9755050 1 gb I AF2691 50. 1 1AF269150 fill Homo sapiens transmembrane protein TM9SF3 (TM9SF3) mRNA, complete 
cds 

Length = 3592 

Score = 34.7 bits (78), Expect = 2.7 

Identities = 14/16 (87%). Positives = 14/16 (87%) 

Frame = +2 

Query: 1 DPXFFQHR I HVFS I FN 16 

DP FFQHRIH FSIFN 
Sbjct: 854 DPSFFQHR I HWFS I FN 901 



O > gi 1 28278504 1 gb I BC04 6021. 1 1 Danio rerio. Similar to expressed sequence AW146116, clone 
MGC: 56246 IMAGE: 5601 580, mRNA, complete cds 
Length = 2082 

Score = 34.7 bits (78). Expect = 2.7 

Identities = 14/16 (87%), Positives = 14/16 (87%) 

Frame = +2 

Query: 1 DPXFFQHR I HVFS I FN 16 

DP FFQHRIH FSIFN 
Sbjct: 725 DPSFFQHR I HWFS I FN 772 



D> gi 119111161 IreflNW 133352.11 SB Mus musculus R I KEN cDNA 1810073M23 gene (1 810073M23R i k> . mRNA 
Length =3143 

Score = 34.7 bits (78), Expect = 2.7 

Identities = 14/16 (87%). Positives = 14/16 (87%) 

Frame = +2 

Query: 1 DPXFFQHR I HVFS I FN 16 
DP FFQHRIH FSIFN 

http://\ftww.ncbi.nlm.nih.gov/bIast/Blast.cgi 
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Sbjct: 713 DPSFFQHR I HWFS I FN 760 



H > gi H3435914lgblBC004799. 1 1 




Mus musculus Rl KEN cDNA 1810073M23 gene, mRNA (cDNA clone MGC:8136 



I MAGE: 3589474), complete cds 
Length = 2848 

Score = 34.7 bits (78), Expect = 2.7 

Identities = 14/16 (87%), Positives = 14/16 (87%) 

Frame = +2 

Query: 1 DPXFFQHRIHVFSIFN 16 

DP FFQHRIH FSIFN 
Sbjct: 410 DPSFFQHR I HWFS I FN 457 



CI > ei 1 9755052 1 gb t AF269 1 51 . 1 1AF269151 fill Mus musculus transmembrane protein TM9SF3 (Tm9sf3) mRNA, complete 
cds 

Length = 3143 

Score = 34.7 bits (78), Expect =2.7 

Identities = 14/16 (87%), Positives = 14/16 (87%) 

Frame = +2 

Query: 1 DPXFFQHRIHVFSIFN 16 

DP FFQHRIH FSIFN 
Sbjct: 713 DPSFFQHR I HWFS I FN 760 



O > gi 1 24658498 1 ref |NM_139722. 2 1 H Drosophi la melanogaster CQ10590-PA [Drosophila melanogaster] 
(CG10590) mRNA, complete cds 
Length = 2663 

Score = 34.3 bits (77), Expect = 3.6 

Identities = 14/16 (87%), Positives = 14/16 (87%) 

Frame - +1 

Query: 1 DPXFFQHRIHVFSIFN 16 

DP FFQHRIH FSIFN 
Sbjct: 823 DPNFFQHR I HWFS I FN 870 



D > gi 1 1 7945956 lab I AY07 1401. 1 i MB Drosophila melanogaster RE48767 full length cDNA 
Length = 2662 

Score = 34.3 bits (77), Expect = 3.6 

Identities = 14/16 (87%), Positives = 14/16 (87%) 

Frame = +1 

Query: 1 DPXFFQHRIHVFSIFN 16 

DP FFQHRIH FSIFN 
Sbjct: 823 DPNFFQHR I HWFS I FN 870 



Database: All GenBank+EMBL+DDBJ+-PDB sequences (but no EST, STS, GSS. 
or phase 0, 1 or 2 HTGS sequences) 

Posted date: May 30. 2003 12:03 AM 
Number of letters in database: -276,428,487 
Number of sequences in database: 1.772,460 

Lambda K H 

0.345 0.155 0.557 

Gapped 

Lambda K H 

0.267 0.0410 0.140 



Matrix: BLOSUM62 

Gap Penalties: Existence: 11, Extension: 1 
Number of Hits to DB: 186,634,521 
Number of Sequences: 1772460 
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Number of extensions: 713471 

Number of successful extensions: 3323!*^ 

Number of sequences better than 10.0: 16 

Number of HSP' s better than 10.0 without gapping: 29804 

Number of HSP' s successfully gapped in prelim test: 0 

Number of HSP' s that attempted gapping in prelim test: 0 

Number of HSP' s gapped (non-prelim): 33235 

length of query: 17 

length of database: 2,771.168.700 

effective HSP length: 0 

effective length of query: 24 

effective length of database: 2,771.168,700 

effective search space: 66508048800 

effective search space used: 66508048800 

frameshift window, decay const: 50, 0.1 

T: 13 

A: 40 

X1: 15 (7.5 bits) 
X2: 38 (14.6 bits) 
X3: 64 (24.7 bits) 
S1: 38 (21.6 bits) 
S2: 73 (32.7 bits) 
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□ 1: AF269149. Gallus gallus tra...[gi:9755048] 



Links 



LOCUS AF269149 234 bp mRNA linear. VRT 09-AUG-2000 

DEFINITION Gallus gallus transmembrane protein TM9SF3 (TM9SF3) mRNA, partial 
cds. 

ACCESSION AF269149 

VERSION AF269149. 1 Gl:9755048 

KEYWORDS 

SOURCE Gallus gallus (chicken) 
ORGANISM Gallus gallus 

Eukaryota: Metazoa; Chordata: Craniata; Vertebrata; Eute leostomi : 
Archosauria; Aves: Neognathae; Gal I i formes; Phasianidae; 
Phasianinae; Gal I us. 
REFERENCE 1 (bases 1 to 234) 
AUTHORS Gong.T.W.. Hegeman. A. D. . Shin.J.J., Adler.H.J.. Raphael. Y. and 
Lomax.M. I. 

TITLE Identification of genes expressed after noise exposure in the chick 

basi lar papi t la 
JOURNAL Hear. Res. 96 (1-2). 20-32 (1996) 
MEDLINE 96414275 
PUBMED 8817303 
REFERENCE 2 (bases 1 to 234) 
AUTHORS Warner. S. J. and Lomax.M. I. 

TITLE Evolution of the TM9 Super Fami ly of Membrane Spanning Proteins 
JOURNAL Unpublished 
REFERENCE 3 (bases 1 to 234) 
AUTHORS Warner. S.J. and Lomax. M. I. 
TITLE Di rect Submission 

JOURNAL Submitted (10-MAY-2000) Kresge Hearing Research Institute, 

University of Michigan, 9301E MSRB III. 1150 West Medical Center 
Drive. Ann Arbor. Ml 48109-0648, USA 
FEATURES Locat i on/Ou a I i f i er s 

source 1..234 

/ Or gam 5in=*Ga i i us ga i i Us" 
/mol_type=*mRNA* 
/db_xref=*taxon:9031* 
/t i ssue_type=*bas i I ar pap ilia* 
gene <1..>234 

/gene=*TM9SF3" 
CDS <1..>234 

/gene=*TM9SF3* 

/note=*transmetnbrane protein 9 superfami ly member 3* 
/codon_start=1 

/product=*transmembrane protein TM9SF3" 
/protein id=" AAF98158. 1 * 
/db_xref=*G 1:9755049* 

/trans I at i on=*ASEGKVKLVPNTK I QMSYSMKWKKSDVKFEDRFDKYLDPSFFQH 
R I HWFS I FNSFMMV I FLVGLVSN I LMRTLRKKKK* 
80 a 38 c 46 g 70 t 



BASE COUNT 
ORIGIN 

1 
61 
121 
181 

// 



gccagcgaag gaaaggtgaa actggtacca aatacaaaaa tccagatgtc atattctatg 

aagtggaaga agtcagatgt gaaatttgaa gaccgatttg acaagtacct tgacccatct 

ttctttcagc acagaattca ctggttttca attttcaatt ctttcatgat ggtgatcttt 

ctggttggcc tagtgtctat gatattaatg agaactttgc gaaaaaaaaa aaaa 
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Details 



□ 1: NMJ78146. Mus musculus RIKE...[gi:30026031] 



LOCUS 

DEFINITION 
ACCESSION 
VERSION 
KEYWORDS 
SOURCE 
ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
MEDLINt 
PUBMED 
REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 



1810073M23Rik 1155 bp mRNA linear ROD 18-MAY-2003 

Mus musculus RIKEN cDNA 1810073M23 gene (181 0073M23R i k) , mRNA. 
NMJ78146 

NMJ 781 46.1 61 : 30026031 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria: Rodentia; Sciurognathi : Muridae; Murinae; Mus. 
1 (bases 1 to 1155) 



Strausberg, R. 
Klausner, R. D. 
Altschul.S.F. 
Hopkins. R. F. , 
D i atchenko, L. 
Stap I eton, M. , 
Scheetz, T. E. , 
Carninci, P. , 
Abramson, R. D. 
McKernan. K. J. 
Worley. K. C. , 



L. , Feingold, E. A. , Grouse, L. H. , Derge, J. G. 



Coll ins, F.S., Wagner, L 
, Zeeberg, B. , Buetow, K. H. , 

Jordan, H. . Moore, T. . Max, 
, Marusina, K. , Farmer, A. A. 

Soares, M. B. . Bonaldo. M. F. 

Brownste i n, M. J. . Usd i n. T 
Prange, C. , Raha, S. S. , Loquel lano, N. A. , 
, Mul lahy.S. J. . Bosak, S.A. , McEwan, P. J. , 
. Malek.J. A. , Gunaratne, P. H. , Richards, S. , 
Hale.S. , Garcia.A.M. f Gay.L.J.. Hulyk.S.W. 



Shenmen, C. M. , Schu I er . G. D. 
Schaefer.C.F. . Bhat,N.K. . 
S. I., Wang, J. , Hsieh.F. , 
Hong, L. , 
L. 
S. . 

Peters, G. J. , 



Rubin, G. M. 
Ca savant, T. 
. B. , Toshiyuki 



Vi llalon,D.K., Muzny, D. M. , Sodergren, E. J. , Lu, X. , Gibbs,R.A., 
Fahey, J. , He I ton, E. , Ketteman, M. , Madan, A. , Rodr i gues, S. , 
Sanchez, A. , Whiting, M. , Madan, A., Young, A. C. , Shevchenko, Y. . 
Bouf f ard. G. G. , B lakes ley, R. W. , Touchman, J. W. , Green, E. D. , 
Dickson, M. C. , Rodri guez, A. C. , Grimwood, J. , Schmutz.J. , Myers. R.M. 
Butterfield, Y.S. , Krzywinski , H. I. , Skalska.U., Smai lus, D. E. , 
Schnerch.A. , Schein, J. E. , Jones. S.J. and Marra, M. A. 
Generation and initial analysis of more than 15.000 full-length 
human and mouse cDNA sequences 

Proc. Natl. Acad. Sci. U.S.A. 99 (26). 16899-16903 (2002) 
223882b/ 
12477932 
2 (bases 1 
Okazaki.Y., 
Nikaido. I. , 
Kiyosawa, H. 



to 1155) 
Furuno. M. 
Osato, N. , 
Yagi.K. , 
Schonbach, C. , Go jobor i , T. , 
Hume, D. A. , Quackenbush, J. . 



Kondo, S. 



Kasukawa. T. , Adach i , J. , Bono, H. 
Saito, R. , Suzuki, H. , Yamanaka, I., 
Tomaru, Y. f Hasegawa, Y. , Nogatni , A. , 

Baldarelli.R., Hi 1 1, D. P.. Bult, C. , 
Schr iml, L M. , Kanapin, A. , Matsuda, H. , 



Batalov.S., Beisel.K.W.. Blake. J. A., Bradt, D. . Brusic.V.. 
Chothia, C. , Corbani , L. E. , Cousins. S. , Dal I a. E. , Dragani . T. A. , 
F I etcher. C. F. , Forrest, A. , Frazer. K. S. , Gaaster I and. T. , 
Gar iboldi . M. t Gissi.C. , Godzik, A. , Gough. J. , Gr immond, S. , 
Gustincich, S. , Hirokawa.N. , Jackson, I. J. , Jarvis, E. D. . Kanai.A. , 
Kawaji.H. , Kawasawa, Y. , Kedzi erski . R. M. . King, B. L. . Konagaya, A. , 
Kurochkin, I. V. , Lee. Y. , Lenhard. B. , Lyons, P. A.. Maglott. D. R. , 
Maltais.L. . March i onni . L. , McKenzie.L. , Miki.H. , Nagashima, T. , 
Numata, K. . Okido, T. , Pavan, W. J. , Pertea, G. , Pesole.G. , 
Petrovsky. N. , Pillai.R. , Pontius, J. U. . Qi.D. . Ramachandran. S. . 
Ravasi,T. . Reed, J. C. , Reed, D. J. , Reid.J. , Ring, B. Z. . Ringwald,M. , 
Sandel in, A. , Schneider, C. , Semple, C. A. . Setou, M. , Shi mad a, K. , 
Sultana. R. , Takenaka, Y. , Taylor, M.S., Teasdale, R. D. , Tomita.M. , 
Verardo.R., Wagner. L. , Wahlestedt. C. , Wang, Y. . Watanabe. Y. , 
Wei Is, C. , Wi Iming, L. G. , Wynshaw-Boris, A. , Yanagi sawa, M. , Yang, I., 
Yang, L. , Yuan. Z. . Zavolan, M. , Zhu, Y. . Zimmer.A. , Carninci, P., 
Hayatsu. N. . Hirozane-Kishikawa, T. . Konno. H. , Nakamura. M. . 
Sakazume, N. . Sato. K. , Shiraki.T. , Waki.K. , Kawai,J. , Aizawa. K. . 
Arakawa. T. . Fukuda. S. , Hara. A. . Hashizume.W. , Imotani.K. , Ishii.Y. , 
Itoh, M. , Kagawa. I.. Miyazaki,A. , SakaiX , Sasaki, D. , Shibata, K. , 
Shinagawa, A. , Yasunishi, A. , Yoshino,M. , Waterston, R. , Lander, E. S. , 
Rogers, J. , Birney.E. and Hayashizaki. Y. 

Analysis of the mouse transcr iptome based on functional annotation 

of 60.770 full-length cDNAs 

Nature 420 (6915). 563-573 (2002) 

22354683 

12466851 

3 

The FANT0M Consortium and the RIKEN Genome Exploration Research 
Group Phase I & 1 1 Team. 

Analysis of the mouse transcr iptome based on functional annotation 
of 60,770 full-length cDNAs 
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JOURNAL Nature 420, 563-573 (2002^P 
COMMENT PROVISIONAL REFSEQ : This record has not yet been subject to final 

NCBI review. The reference sequence was derived from AK075807. 1 , 
FEATURES Location/Qualifiers 
source 1..1155 

/organ ism="Mus musculus" 
/mol_type="mRNA" 
/strain="C57BL/6J" 
/db_xref="taxon:l0090" 
/chromosome="1 9" 
gene 1 . . 1 1 55 

/gene="1810073M23Rik" 

/note="synonyms : SMBP, Tm9sf3, 281 0031 D16Rik" 
/db_xr ef ="Locus I D : 107358 " 
/db xref="MGI: 1914262 " 
CDS 1..1155 

/gene="1810073M23Rik" 

/note="putativeunknown EST (GB | BG073434, evidence: BLASTN, 
99%. match=803); SM-1 1044 binding protein" 
/codon_start=1 

/product=*transmembrane protein 9 superfami ly member 3" 
/prote i n_i d=" NP 835359. 1 " 
/db_xref="G 1:30026032" 
/db_xr ef ="Locus I D : 107358 " 
/db_xref="MGI: 1914262 " 

/trans I at i on="MRPLPGAPGVAAAAALLLLLLPRARSDEHEHTYQDKEEVVLWMN 
TVGPYHNRQETYKYFSLPFCVGSKKS I SHYHETLGEALQGVELEFSGLD I KFKDDVMP 
GTYCE I DLDKEKRDAFVYA I KNHYWYQMY I DDLP I WG I VGEADENGEDYYLWTYKKLE 
I GFNGNR I VDVNLTSEGKVKLVPNTK I QMSYSVKWKKSDVKFEDRFDKYLDPSFFQHR 
I HWFS I FNSFMMV I FLVGLVSM I LWRTLRKDYARYSKEEEMDDMDRDLGDEYGWKQVH 
GDVFRPSSHPL I FSSL I GSGCQ I FAVSL I V 1 1 VAM I EDLYTERGSMLSTA I FVYAATS 
PVNGYFGGSLYARQGGRRW I KDVYRG I PYPSYGVWHC I LHQLYSHLLPCL" 
misc feature 160. .1071 

/gene="1810073M23Rik" 

/note="EMP70; Region: Endomembrane protein 70" 

/db„xref="CDD : pf am02990 " 
BASE COUNT 340 a 199 c 278 g 338 t 
ORIGIN 

1 atgaggccgc tgccgggcgc tcctggcgtg gcggcggccg ccgcgttgtt gttgctgctg 
61 ctgccccggg ctcggtctga cgagcacgaa cacacgtatc aagataaaga agaagttgtt 

121 ttatggatga atactgttgg gccctaccac aatcgccaag aaacatataa atacttttca 

181 cttccattct gtgtggggtc aaaaaaaagt atcagtcatt accatgaaac tttgggagaa 

241 gcacttcaag gagttgaatt agaatttagt ggcctggaca ttaaatttaa agatgatgtg 

301 atgccaggca cttactgtga aattgactta gataaagaga agagagatgc gtttgtgtat 

361 gctatcaaga atcactattg gtatcagatg tacatagatg acttaccaat atggggtatt 

421 gttggtgaag C2g3tgaB38 tggggaagat tactatcttt ggacctacaa aaaacttgaa 

481 ataggcttta atggaaatcg aattgttgat gttaatctaa ctagtgaagg aaaggtcaag 

541 ctggttccaa atactaaaat acagatgtca tattcggtaa aatggaaaaa atcggatgta 

601 aaatttgaag atcgattcga taaatatctt gatccatcct tttttcagca taggattcac 

661 tggttttcaa tttttaattc cttcatgatg gtgatcttct tagtgggatt agtttcaatg 

721 attttaatga gaactttaag gaaagattat gcccgataca gtaaagaaga agaaatggat 

781 gacatggaca gagacctagg agacgagtat ggctggaagc aggtgcatgg agatgtgttc 

841 agaccgtcaa gtcaccctct gatcttctcc tccctcattg gctctggatg tcagatattt 

901 gctgtgtctc tcattgttat tattgttgcc atgatagagg acttatatac agagaggggg 

961 tcaatgctca gcacagccat atttgtctat gctgctacat cgccagtgaa cgggtacttt 
1021 ggaggaagtc tgtatgctag acaaggagga aggagatgga taaaggatgt ttatcggggc 
1081 attccttatc ccagctatgg tgtgtggcac tgcattcttc atcaacttta tagccattta 
1141 ttaccatgcc tctag 

// 
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LOCUS AF1 16347 1804 bp mRNA linear PR I 16-0CT-2001 

DEFINITION Homo sapiens SM-11044 binding protein mRNA, partial cds. 

ACCESSION AF1 16347 

VERS I ON AF1 1 6347. 1 G 1 : 6650721 

KEYWORDS 

SOURCE Homo sapiens (human) 
ORGANISM Homo sapiens 

Eukaryota: Metazoa; Chordata; Craniata: Vertebrata; Euteleostomi ; 
Mammalia; Eutheria: Primates; Catarrh ini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 1804) 
AUTHORS Sugasawa, T. , Lenzen, G. , Simon, S. , Hidaka, J. , Cahen, A. , 

Gu i 1 1 aume, J. L. , Camoin,L., Strosberg, A. D. and Nahmias.C. 
TITLE The iodocyanopindolol and SM-11044 binding protein belongs to the 

TM9SF multispanning membrane protein superfami ly 
JOURNAL Gene 273 (2). 227-237 (2001) 
MEDLINE 21479397 
PUBMED 11595169 
REFERENCE 2 (bases 1 to 1804) 
AUTHORS Sugasawa, T. , Lenzen. G. , Simon, S. , Hidaka, J. . Cahen, A. , 

Gui I laume, J. L. , Camoin, L , Nahmias.C. and Strosberg, A. D. 
TITLE Direct Submission 

JOURNAL Submitted (23-DEC-1998) Vetigen, 21 rue Sebastien Mercier, Paris 
75015, France 
FEATURES Location/Qualifiers 
source 1..1804 

/organ i smF*Homo sap i ens* 
/mol_type="mRNA" 
/db_xref="taxon:9606" 
/tissue_type="skeletal muscle* 
CDS <1. . 1737 

/nctc^ccntains nine potential membrane spanning domains; 
similar to yeast EMP70 protein precursor encoded by 
GenBank Accession Numbers X67316 and S25110" 
/codon_start=1 

/product="SM-11044 binding protein" 
/protei nj d=" AAF21983. 1 " 
/db_xref="G I: 6650722" 

/trans I ati on="AAAALWLLLLLLPRTRADEHEHTYQDKEEVVLWMNTVGPYHNRO 
ETYKYFSLPFCVGSKKS I SHYHETLGEALOGVELEFSGLD I KFKDDVMPATYCE I DLD 
KEKRDAFVYA I KNHYWYQMY I DDLP I WG I VGEADENGEDYYLWTYKKLE I GFNGNR I V 
DVNLTSEGKVKLVPNTK I QMSYSVKWKKSDVKFEDRFDKYLDPSFFQHR I HffFS I FNS 
FMMV I FLVGLVSM I LMRTLRKDYARYSKEEEMDDMDRDLGDEYGWKOVHGDVFRPSSH 
PL I FSSL I GSGCQ I FAVSL I V 1 1 VAM I EDLYTERGSMLSTA I FVYAATSPVNGYFGGS 
LYAROGGRRW I KQMF I GAFL I PAMVCGTAFF I NF I A I YYHASRA I PFGTMVAVCC I CF 
FV I LPLNLVGT I LGRNLSGQPNFPCRVNAVPRP I PEKKWFMEPAV I VCLGG I LPFGS I 
F I EMYF I FTSFWAYK I Y YVYGFMMLVLV I LC I VTVCVT I VCTYFLLNAEDYRWQWTSF 
LSAASTA I YVYMYSFYYYFFKTKMYGLFOTSFYFGYMAVFSTALG I MCGA I GYMGTSA 
FVRKIYTNVKID" 

BASE COUNT 509 a 301 c 403 g 591 t 

ORIGIN 

1 gcggccgccg cgctgtggct gctgctgctg ctgctgcccc ggacccgggc ggacgagcac 
61 gaacacacgt atcaagataa agaggaagtt gtcttatgga tgaatactgt tgggccctac 
121 cataatcgtc aagaaacata taagtacttt tcacttccat tctgtgtggg gtcaaaaaaa 
181 agtatcagtc attaccatga aactctggga gaagcacttc aaggggttga attggaattt 
241 agtggtctgg atattaaatt taaagatgat gtgatgccag ccacttactg tgaaattgat 
301 ttagataaag aaaagagaga tgcatttgta tatgccataa aaaatcatta ctggtaccag 
361 atgtacatag atgatttacc aatatggggt attgttggtg aggctgatga aaatggagaa 
421 gattactatc tttggaccta taaaaaactt gaaataggtt ttaatggaaa tcgaattgtt 
481 gatgttaatc taactagtga aggaaaggtg aaactggttc caaatactaa aatccagatg 
541 tcatattcag taaaatggaa aaagtcagat gtgaaatttg aagatcgatt tgacaaatat 
601 cttgatccgt ccttttttca acatcggatt cattggtttt caattttcaa ctccttcatg 
661 atggtgatct tcttggtggg cttagtttca atgattttaa tgagaacatt aagaaaagat 
721 tatgctcggt acagtaaaga ggaagaaatg gatgatatgg atagagacct aggagatgaa 
781 tatggatgga aacaggtgca tggagatgta tttagaccat caagtcaccc actgatattt 
841 tcctctctga ttggttctgg atgtcagata tttgctgtgt ctctcatcgfc. tattattgtt 
901 gcaatgatag aagatttata tactgagagg ggatcaatgc tcagtacagc "catatttgtc 
961 tatgctgcta cgtctccagt gaatggttat tttggaggaa gtctgtatgc tagacaagga 
1021 ggaaggagat ggataaagca gatgtttatt ggggcattcc ttatcccagc tatggtgtgt 
1081 ggcactgcct tcttcatcaa tttcatagcc atttattacc atgcttcaag agccattcct 
1141 tttggaacaa tggtggccgt ttgttgcatc tgtttttttg ttattcttcc tctaaatctt 
1201 gttggtacaa tacttggccg aaatctgtca ggtcagccca actttccttg tcgtgtcaat 
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gctgtgcctc gtcctatacc ggagal 



tggttcatgg agcctgcggt tattgtttgc 



1321 ctgggtggaa ttttaccttt tggttcaatc tttattgaaa tgtatttcat cttcacgtct 

1381 ttctgggcat ataagatcta ttatgtctat ggcttcatga tgctggtgct ggttatcctg 

1441 tgcattgtga ctgtctgtgt gactattgtg tgcacatatt ttctactaaa tgcagaagat 

1501 taccggtggc aatggacaag ttttctctct gctgcatcaa ctgcaatcta tgtttacatg 

1561 tattcctttt actactattt tttcaaaaca aagatgtatg gcttatttca aacatcattt 

1621 tactttggat atatggcggt atttagcaca gccttgggga taatgtgtgg agcgattggt 

1681 tacatgggaa caagtgcctt tgtccgaaaa atctatacta atgtgaaaat tgactagaga 

1741 cccaagaaaa cctggaactt tggatcaatt tctttttcat aggggtggaa cttgcacagc 

1801 aaaa 



// 
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□ 1: NM.067729. Caenorhabditis el...[gi:1 7543267] 



Links 



LOCUS 4C515 1743 bp mRNA linear INV 22-N0V-2002 

DEFINITION Caenorhabditis elegans transmembrane protein TM9SF3 (66.6 kD), 
mRNA. 

ACCESSION NM.067729 

VERS ION NMJJ67729. 1 G 1 : 1 7543267 

KEYWORDS 

SOURCE Caenorhabditis elegans 
ORGANISM Caenorhabditis elegans 

Eukaryota; Metazoa: Nematoda: Chromadorea; Rhabditida: 
Rhabd ttoidea; Rhabditidae; Peloderinae; Caenorhabditis. 
COMMENT VALIDATED REFSEQ : This record has undergone preliminary review of 
the sequence, but has not yet been subject to final NCBI review. 
The reference sequence was derived from WormBase CDS: Y41D4A. 4. 
Fully supported by cDNA from the Worm Transcriptome Project. 
COMPLETENESS : full length. 
FEATURES Location/Qua! i f i ers 

source 1..1743 

/organism="Caenorhabditis elegans" 
/mol_type="mRNA" 
/db_xref="taxon:6239" 
/chromosome="IV" 

/map="IV;-15.38 cM (interpolated genetic position)" 
/map="IV; covering 5823 bp, from base 1745879 to 1751703 
on genome release WS85" 

/clone="Primers to amplify the CDS (Stop included): 
ATGCGGGTACTGATAAGCC (T=57. 6) . 
TTAATCGATTTTCACAGTCTGATAGATTT (T=56. 4) " 
/c lone_ I ib=" [Predicted eel lu I ar localisation] PSORT II 
analysis, (K. Nakai) trained on yeast data predicts that 
the subcellular location of this protein is most likely in 
the endoplasmic reticulum (66%) or in the plasma membrane 
\ ■»"-- /*/ • 

gene 1 . . 1 743 

/gene="4C515" 
/tocus_tag="4C515" 

/note="synonyms: Y41D4A.4, CELK01378" 
CDS 1..1743 

/gene="4C515" 
/locus_tag="4C515" 

/function="Transporter G0:0005215 [Pfam/lnterpro/GO]" 
/function="Transport G0:0006810 [Pfam/1nterpro/G0]" 
/function=" Integra I membrane protein GO: 001 6021 
[Pfam/lnterpro/GO]" 

/note="|| [Common ancestor, from TaxBlast with threshold 
10-3] Eukaryota || [Closest homo logs] Dictyostel ium 
discoideum: emb | CAC47950. 1 1 (score=271); Arabidopsis 
tha liana: ref |NP 563881. 1 1 (528): Saccharomyces 
cerevisiae: ref |NP 010392. 1 1 (211); Schi zosaccharomyces 
pombe: ref|NP 596464.1] (177): Caenorhabditis elegans: 
ref|NP 500130.1J (1028): Anopheles gambiae str PEST: 
gb|EAA09712. 1 | (724): Drosophi la melanogaster : ref|NP 
647979.11 (728): Mus musculus: ref|NP 579930.1] (760); 
Homo sapiens: ref |XP 050993. 1 1 (761)" 
/codon_start=1 

/product="Transmembrane protein TM9SF3 (66.6 kD)" 

/protei n id=" NP 500130.1 " 

/db_xref ="G I : 1 7543268" 

/db_xr ef ="AceV i ew/WormGenes : 4C51 5 " 

/db.xref ="CDD : pfam02990" 

/db.xref ="Locus I D : 176983 " 

/db.xref ="NextDB : CELK01378 " 

/db xref~"WormBase : Y41 D4A. 4 " 

/trans I ati on="MRVLISLLLFTSLLVEADEHDHLYEVDEEVVLWMNTVGPYSNRO 
ETYTYFSLPFCK6EKKE I GHYHETMGESLLGVELEFSGLD I KFRTNTKKTWCEKKLT 
ETDYKTLLYA I QNSYY YQMYLDDMP 1 WGMVGE I DNTVNPPAYKLYTHKRLD I GYNDKQ 
VVDVNLTTDGRVD I RPGAELTFTYEVQWSKSEVDFTKRFDKYLDPSFFQHR I HWFS I F 
NSFMMWFLVGLVWM I LVRTLRKDYARYQKEDSLDDLDADLGDEYGWKOVHGDVFRPP 
SMPLLFSSC I GAGYHVFTVAV I TT I LAI VGEFYTERGSLLSAA I FVYAACSPVNGYAG 
GSMYARFGGRHW I KQMLLGAFLLPSMVCGVAFL I NF I A I YYHASRA I PFT I MLAVS A I 
CLFV I LPLTLVGTV I GRNMAGSADYPCRVNAVPRP I PDKKWFVOPWL I TLAGGVLPFG 
S I F I EMYF I FTSFWAYK I YYVYGFMLLVT 1 1 LS I VTVCVT I VCSYFLLNAEDYRWRWT 
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FKTKMYGLFQTVFYFGYMG I FAAALGLMTGT I GYVGT 



SFACGASTAFYVYMYS 
AKFVRKlYOTVKtD" 
misc feature 1..51 

/gene="4C515" 
/locus_tag="4C515" 

/note="Region: [PSORT] N terminal signal domain" 
misc feature 133. .1614 

/gene="4C515~ 
/locus_tag="4C515~ 

/note="Region: [Pfam/ Inter Pro description] endomembrane 
protein 70: the transmembrane 9 superfami ly protein 
(TM9SF) may function as a channel or small molecule 
transporter. Proteins in this group are endosomal integral 
membrane proteins. HMMER score 820.10 Evalue=5. 1e-244" 
/db.xref ="CDD : pfam02990 " 
misc feature 658. . 708 

/gene="4C515" 
/locus_tag="4C515~ 

/note="Region: [PSORT] transmembrane domain: 
SFMMVVFLVGLVWM I LV" 
misc feature 874. . 924 

/gene="4C515~ 
/locus_tag="4C515" 

/note="Region: [PSORT] transmembrane domain: 
VFTVAV I TT I LA I VGEF" 
misc feature 937. . 987 

/gene="4C515' r 
/locus_tag="4C515" 

/note="Region: [PSORT] transmembrane domain: 
GSLLSAA I FVYAACSPV 
misc feature 1054. .1104 

/gene="4C515" 
/locus_tag="4C515" 

/note="Region: [PSORT] transmembrane domain: 
LGAFLLPSMVCGVAFL I " 
misc_feature 1 159. . 1 209 

/gene="4C515* 
/locus_tag="4C515" 

/note="Region: [PSORT] transmembrane domain: 
LAVSA I CLFV I LPLTLV" 
misc feature 1 31 2. . 1 362 

/gene="4C515" r 
/locus„tag="4C515" 

/note="Region: [PSORT] transmembrane domain: 
WL I TLAGGVLPFGS I F I * 
misc featur e 1 426. . 1 476 

/gene^CSIS" 
/locus_tag="4C515" 

/note="Region: [PSORT] transmembrane domain: 
LLVTIILSIVTVCVTIV 
misc_feature 1726. .1737 

/gene="4C515" 
/locus_tag=' r 4C515* 

/note="Region: [PSORT] ER membrane domain: TVKT 
BASE COUNT 442 a 428 c 404 g 469 t 
ORIGIN 

1 atgcgggtac tgataagcct gctactattc acctcgctgc tcgtcgaagc cgacgaacac 
61 gaccatttgt acgaggtcga cgaggaggtt gtattatgga tgaacacggt gggaccctac 
121 tcgaatcgcc aggaaaccta cacctacttc tccctgccat tttgtaaagg agagaagaag 
181 gaaattggcc attatcacga aactatggga gaatcactgc tcggcgtgga gctagagttc 
241 tccggattgg atataaaatt cagaacaaac accaagaaaa ccgtggtttg cgagaagaag 
301 ctcaccgaaa ccgactataa aacgctgctc tacgctatcc aaaacagcta ttactatcaa 
361 atgtacctcg acgacatgcc gatttgggga atggttggag aaatcgataa tactgtgaat 
421 ccgccggcct acaagcttta tacgcataag aggctagata ttggctataa cgataagcag 
481 gtggtcgacg tgaatttgac aacagacgga agagttgata ttcgtcccgg cgctgagctc 
541 acgttcacct atgaggtgca atggagcaag tcggaagtgg attttaccaa gagattcgat 
601 aaatatctcg atccaagctt cttccagcac agaatccact ggttctcgat ttttaacagt 
661 ttcatgatgg tcgtattcct cgtcggattg gtatggatga ttctagtccg tactctacga 
721 aaagactacg cccgatacca aaaagaggat agtctcgacg atttggacgc cgatctcgga 
781 gatgagtacg gttggaaaca ggttcacgga gacgtcttcc gtccaccatc aatgccgttg 
841 ctcttctcat cgtgtattgg agccggttat catgtgttca ctgtcgcggt tattacgacg 
901 attttggcga ttgttggaga attttataca gaacgcggct ccctactctc cgccgcaatc 
961 ttcgtctacg ccgcttgctc gccagtcaac ggatatgccg gcggatcgat gtacgccaga 
1021 ttcggaggtc gtcactggat caagcaaatg ctcctcggag cattccttct gccaagtatg 
1081 gtgtgtggtg tcgcattcct tatcaacttt atcgcgatct actatcacgc gtcacgcgcc 
1141 attccattca caattatgct cgccgtctcg gcgatctgcc tctttgtgat ccttccgctg 
1201 actcttgtcg gaactgtgat cggaagaaat atggctggaa gtgcagacta tccgtgtaga 
1261 gtgaatgccg tgccgagacc cattcctgat aagaaatggt tcgtccaacc gtggctcatc 
1321 acccttgccg gaggtgttct cccattcgga tccatcttca tcgaaatgta ctttatcttc 
1381 acctcattct gggcgtacaa aatctattat gtgtacggat tcatgctttt ggtcacaata 
1441 atcctgtcaa ttgtcaccgt atgcgtcaca attgtctgct cctacttcct actcaacgcc 
1501 gaggattata gatggagatg gacgagcttc gcgtgtggag catctactgc attctatgtg 
1561 tacatgtatt ccatgtatta tttcttcttc aagacaaaaa tgtacggact tttccaaacg 
1621 gtcttctact ttggatacat gggaatcttc gccgccgctc tcggactcat gaccggcacc 
1681 atcggatacg tcggaaccgc caaattcgtc cgcaaaatct atcagactgt gaaaatcgat 
1741 taa 
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LOCUS AF269150 3592 bp mRNA linear PR I 09-AUG-2000 

DEFINITION Homo sapiens transmembrane protein TM9SF3 (TM9SF3) rnRNA; complete 
cds. 

ACCESSION AF269150 

VERSION AF269150. 1 Gl:9755050 

KEYWORDS 

SOURCE Homo sapiens (human) 
ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata: Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria: Primates; Catarrhini: Hominidae; Homo. 
REFERENCE 1 (bases 1 to 3592) 
AUTHORS Warner, S. J. and Lomax. M. I . 

TITLE Evolution of the TM9 Super Family of Membrane Spanning Proteins 
JOURNAL Unpublished 
REFERENCE 2 (bases 1 to 3592) 
AUTHORS Warner. S. J. and Lomax, M. I . 
TITLE Direct Submission 

JOURNAL Submitted (10-MAY-2000) Kresge Hearing Research Institute, 

University of Michigan, 9301 E MSRB III, 1150 West Medical Center 
Drive, Ann Arbor, Ml 48109-0648, USA 
FEATURES Location/Qual i f iers 

source 1. .3592 

/organism="Homo sapiens* 
/mo I _type="mRNA~ 
/db_xr ef ="taxon : 9606* 
/chromosome="1 0" 
/map="10q23-q24" 
gene 1, . 3592 

/gene="TM9SF3" 
CDS 218.. 1987 

/gene="TM9SF3" 

/note="transmembrane protein 9 super f ami iy member 3" 
/codon_start=1 

/product="transmembrane protein TM9SF3" 
/protei n_i d=" AAF98159. V 
/db_xref="G I: 9755051" 

/trans I at i on="MRPLPGALGVAAAAALWLLLLLLPRTRADEHEHTYQDKEEVVLW 
MNTVGPYHNRQETYKYFSLPFCVGSKKS I SHYHETLGEALQGVELEFSGLD I KFKDDV 
MPATYCE I DLDKEKRDAFVYA I KNHYWYQMY I DDLP I WG I VGEADENGEDYYLWTYKK 
LE I GFNGNR I VDVNLTSEGKVKLVPNTK I 0 I SYSVKWKKSDVKFEDRFDKYLDPSFFO 
HR I HWFS I FNSFMMV I FLVGLVSM I LMRTLRKDYARYSKEEEMDDMDRDLGDEYGWKQ 
VHGDVFRPSSHPL I FSSL I GSGCQ I FAVSL I V 1 1 VAM I EDLYTERGSMLSTA I FVYAA 
TSPVNGYFGGSLY ARQGGRRW I KQMF I GAFL I PAMVCGTAFF I NF I A I YYHASRA I PF 
GTMVAVCC I CFFV I LPLNLVGT I LGRNLSGQPNFPCRVNAVPRP I PEKKWFMEPAV I V 
CLGG I LPFGS I F I EMYF I FTSFWAYK I YYVYGFMMLVLV I LC I VTVCVT I VCTYFLLN 
AEDYRWQWTSFLSAASTA I YVYMYSFYYYFFKTKMYGLFQTSFYFGYMAVFSTALG I M 
CGA I GYMGTSAFVRK I YTNVK I D" 

BASE COUNT 1026 a 604 c 815 g 1147 t 

ORIGIN 

1 gaggaagagg ctgaggaggc gcggggggcg ggggaggctc aggagcgggc ggtgacggcg 
61 acggcggcgg cagaggaggc agcggctggg ccgggccccg tgcgtctgtc cgcgccccgt 
121 ggatgcgaat cggccgcggc ggaggcggcg gcggcggagg aggcggcggc gggaggagga 
181 gtcggtgagc cggctccggg ccggaggggc gcggaggatg aggccgctgc ctggcgctct 
241 tggcgtggcg gcggccgccg cgctgtggct gctgctgctg ctgctgcccc ggacccgggc 
301 ggacgagcac gaacacacgt atcaagataa agaggaagtt gtcttatgga tgaatactgt 
361 tgggccctac cataatcgtc aagaaacata taagtacttt tcacttccat tctgtgtggg 
421 gtcaaaaaaa agtatcagtc attaccatga aactctggga gaagcacttc aaggggttga 
481 attggaattt agtggtctgg atattaaatt taaagatgat gtgatgccag ccacttactg 
541 tgaaattgat ttagataaag aaaagagaga tgcatttgta tatgccataa aaaatcatta 
601 ctggtaccag atgtacatag atgatttacc aatatggggt attgttggtg aggctgatga 
661 aaatggagaa gattactatc tttggaccta taaaaaactt gaaataggtt ttaatggaaa 
721 tcgaattgtt gatgttaatc taactagtga aggaaaggtg aaactggttc caaatactaa 
781 aatccagatt tcatattcag taaaatggaa aaagtcagat gtgaaatttg aagatcgatt 
841 tgacaaatat cttgatccgt ccttttttca acatcggatt cattggtttt caattttcaa 
901 ctccttcatg atggtgatct tcttggtggg cttagtttca atgattttaa tgagaacatt 
961 aagaaaagat tatgctcggt acagtaaaga ggaagaaatg gatgatatgg atagagacct 
1021 aggagatgaa tatggatgga aacaggtgca tggagatgta tttagaccat caagtcaccc 
1081 actgatattt tcctctctga ttggttctgg atgtcagata tttgctgtgt ctctcatcgt 
1141 tattattgtt gcaatgatag aagatttata tactgagagg ggatcaatgc tcagtacagc 
1201 catatttgtc tatgctgcta cgtctccagt gaatggttat tttggaggaa gtctgtatgc 
1261 tagacaagga ggaaggagat ggataaagca gatgtttatt ggggcattcc ttatcccagc 
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1321 tatggtgtgt ggcactgcct tctt<^^^ tttcatagcc atttattacc atgcttcaag 

1381 agccattcct tttggaacaa tggtggocgt ttgttgcatc tgtttttttg ttattcttcc 

1441 tctaaatctt gttggtacaa tacttggccg aaatctgtca ggtcagccca actttccttg 

1501 tcgtgtcaat gctgtgcctc gtcctatacc ggagaaaaaa tggttcatgg agcctgcggt 

1561 tattgtttgc ctgggtggaa ttttaccttt tggttcaatc tttattgaaa tgtatttcat 

1621 cttcacgtct ttctgggcat ataagatcta ttatgtctat ggcttcatga tgctggtgct 

1681 ggttatcctg tgcattgtga ctgtctgtgt gactattgtg tgcacatatt ttctactaaa 

1741 tgcagaagat taccggtggc aatggacaag ttttctctct gctgcatcaa ctgcaatcta 

1801 tgtttacatg tattcctttt actactattt tttcaaaaca aagatgtatg gcttatttca 

1861 aacatcattt tactttggat atatggcggt atttagcaca gccttgggga taatgtgtgg 

1921 agcgattggt tacatgggaa caagtgcctt tgtccgaaaa atctatacta atgtgaaaat 

1981 tgactagaga cccaagaaaa cctggaactt tggatcaatt tctttttcat aggggtggaa 

2041 cttgcacagc aaaaacaaac aaacgcaaga agagatttgg gctttaacac actgggtact 

2101 ttgtgggtct ctctttcgtc ggtggcttaa agtaacatct atttccattg atcctaggtt 

2161 cttcctgact gctttctcca actgttcaca gcaaatgctt ggattttatg cagtaggcat 

2221 tactacagta catggctaat cttcccaaaa actagctcat taaagatgaa atagaccagc 

2281 tctcttcagt gaagaggaca aatagtttat ttaaagcatt tgttccaata aaataaatag 

2341 agggaaactt ggatgctaaa attacatgaa taggaatctt cctggcactt agtgtttcta 

2401 tgttattgaa aaatgatgtt ccagaaagat tacttttttc ctcttatttt tactgccatt 

2461 gtcgacctat tgtgggacat ttttatatat tgaatctggg ttcttttttg actttttttt 

2521 ttcccaatcc aacagcatcc ttttttttaa aagagagaat tagaaaatat taaatcctgc 

2581 atgtaatata tctgctgtca tcttagttgg accaacttcc catttattta tcttaaaact 

2641 atacagttac atcttaattc catccaaaga agatacagtt tgaagacaga agtgtactct 

2701 ctacaatgca atttactgta cagttagaaa gcaaagtgtt aaatggagaa gatacttgtt 

2761 tttattaaac attttgagat ttagataaac tacattttaa ctgaatgtct aaagtgatta 

2821 tcttttttcc ccccaagtta gtcttaaatc ttttgggttt gaatgaaggt tttacataag 

2881 aaattattaa aaacaagggg ggtgggtaat aaatgtatat aacattaaat aatgtaacgt 

2941 aggtgtagat tcccaaatgc atttggatgt acagatcgac tacagagtac ttttttctta 

3001 tgatgattgg tgtagaaatg tgtgatttgg gtgggctttt acatcttgcc taccattgca 

3061 tgaaacattg gggtttcttc aaaatgtgtg tgtcatactt cttttgggag gggggttgtt 

3121 ttcttctgtt tattttctga gactcctaca ggagccaaat ttgtaattta gagacactta 

3181 attttgttaa tcctgtctgg gacacttaag taacatctaa agcattattg ctttagaatg 

3241 ttcaaataaa atttcctgac caaattgttt tgtggaaata gatgtgtttg caatttgaag 

3301 atatctttct gtccagaagg caaaattacc gaatgccatt tttaaaagta tgctataaac 

3361 tatgctactc tcatacaggg gacccgtatt ttaaaatctc cagacttgct tacatctaga 

3421 ttatccagca caatcataaa gtgaatgaca aaccctttga atgaaattgt ggcacaaaat 

3481 ctgttcaggt tggtgtaccg tgtaaagtgg ggatggggta aaagtggtta acgtactgtt 

3541 ggatcaacaa ataaaggtta cagttttgta agagaaaaaa aaaaaaaaaa aa 
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D 1: AF269151. Mus musculus tran...[gi:9755052] 



LOCUS AF269151 3143 bp mRNA linear ROD 09-AUG-2000 

DEFINITION Mus musculus transmembrane protein TM9SF3 (Tm9sf3) mRNA, complete 
cds. 

ACCESSION AF269151 

VERSION AF269151.1 Gl:9755052 

KEYWORDS 

SOURCE Mus musculus (house mouse) 
ORGANISM Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodentia: Sciurognathi ; Muridae; Murinae; Mus. 
REFERENCE 1 (bases 1 to 3143) 
AUTHORS Warner. S. J. and Lomax, M. I. 

TITLE Evolution of the TM9 Super Family of Membrane Spanning Proteins 
JOURNAL Unpublished 
REFERENCE 2 (bases 1 to 3143) 
AUTHORS Warner, S. J. and Lomax. M. I. 
TITLE Di rect Submission 

JOURNAL Submitted (10-MAY-2000) Kresge Hearing Research Institute, 

University of Michigan. 9301E MSRB III, 1150 West Medical Center 
Drive. Ann Arbor, Ml 48109-0648, USA 
FEATURES Location/Qua I if iers 

source 1 . .3143 

/organism= w Mus musculus* 
/mol_type="mRNA" 
/db_xref="taxon: 10090" 
gene 1..3143 

/gene="Tm9sf3" 
CDS 83. .1846 

/gene=~Tm9sf3" 

/note=~transmembrane protein 9 superfami ly member 3" 
/codon_start=1 

/product 1 * transmembrane pi O lc i i~l TmQSFS 

/protein id=" AAF98160. 1 " 
/db_xref="G I: 9755053" 

/trans I at i on="MRPLPGAPGVAAAAALLLLLLPRARSDEHEHTYQDKEEVVLWMN 
TVGPYHNRQETYKYFSLPFCVGSKKS I SHYHETLGEALOGVELEFSGLD I KFKDDVMP 
GTYCE I DLDKEKRDAFVYA I KNHYWYQMY I DDLP I WG I VGEADENGEDYYLWTYKKLE 
I GFNGNR I VDVNLTSEGKVKLVPNTK I QMSYSVKWKKSDVKFEDRFDKYLDPSFFOHR 
I HWFS I FNSFMMV 1 FLVGLVSM I LMRTLRKDYARYSKEEEMDDMDRDLGDEYGWKQVH 
GDVFRPSSHPL I FSSL I GSGCQ I FAVSL I VI 1VAMI EDLYTERGSMLSTA I FVYAATS 
PVNGY FGGSLYARQGGRRW I KQMF I GAFL I PAMVCGTAFF I NF I A I YYHASRA I PFGT 
MVAVCC I CFFV I LPLNLVGT I LGRNLSGQPNFPCRVNAVPRP I PEKKWFMEP AV I VCL 
GG I LPFGS I F I EMYF I FTSFWAYK I YYVYGFMMLVLV I LC I VTVCVT I VCTYFLLNAE 
DYRWQWTSFLSAASTA I YVYMYSFYYYFFKTKMYGLFOTSFYFGYMAVFSTALG I MCG 
A I GYMGTSAFVRK I YTNVK I D" 

BASE COUNT 868 a 564 c 699 g 1012 t 

ORIGIN 

1 tccgcgcccg gtggatgtga ateggeggeg geggeggagg aggagteggt gggccggcgc 
61 ccggcctgtg ggagcgcgga ggatgaggee gctgccgggc gctcctggcg tggeggegge 
121 cgccgcgttg ttgttgctgc tgctgccccg ggcteggtet gacgagcacg aacacaegta 
181 tcaagataaa gaagaagttg ttttatggat gaatactgtt gggccctacc acaatcgcca 
241 agaaacatat aaatactttt cacttccatt ctgtgtgggg tcaaaaaaaa gtatcagtca 
301 ttaccatgaa actttgggag aagcacttca aggagttgaa ttagaattta gtggcctgga 
361 cattaaattt aaagatgatg tgatgecagg cacttactgt gaaattgact tagataaaga 
421 gaagagagat gcgtttgtgt atgctatcaa gaatcactat tggtatcaga tgtacataga 
481 tgacttacca atatggggta ttgttggtga agcagatgaa aatggggaag attactatct 
541 ttggacctac aaaaaacttg aaataggctt taatggaaat cgaattgttg atgttaatct 
601 aactagtgaa ggaaaggtca agctggttcc aaatactaaa atacagatgt catatteggt 
661 aaaatggaaa aaatcggatg taaaatttga agatcgattc gataaatatc ttgatccatc 
721 cttttttcag cataggattc actggttttc aatttttaat tccttcatga tggtgatctt 
781 cttagtggga ttagtttcaa tgattttaat gagaacttta aggaaagatt atgeccgata 
841 cagtaaagaa gaagaaatgg atgacatgga cagagaccta ggagacgagt atggctggaa 
901 geaggtgeat ggagatgtgt tcagaccgtc aagtcaccct ctgatcttct cctccctcat 
961 tggctctgga tgtcagatat ttgctgtgtc tctcattgtt attattgttg ccatgataga 
1021 ggacttatat acagagaggg ggtcaatget cagcacagcc atatttgtct atgetgetae 
1081 ategecagtg aacgggtact ttggaggaag tetgtatget agacaaggag gaaggagatg 
1141 gataaagcag atgtttatcg gggcattcct tatcccagct atggtgtgtg geactgeatt 
1201 cttcatcaac tttatageca tttattacca tgectctaga gccattcctt ttggaacaat 
1261 ggtggccgtt tgttgcatct gtttctttgt tatccttcct ctaaatctcg ttggtacaat 
1321 actgggtcga aatctgtcgg gtcagcccaa cttcccttgt cgtgtcaatg ccgtgcctcg 
1381 tcccatcccg gagaaaaaat ggtttatgga gectgeagtt attgtttgcc tgggaggaat 
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1441 tttacctttt ggatcaatct ttattfl^c gtacttcatc ttcacgtcct tctgggcata 

1501 taagatctac tatgtctatg gcttcafgat gctggtgctg gtcatcctgt gcattgtgac 

1561 tgtctgtgtg accattgtct gcacatactt cctgctaaac gcagaggatt acaggtggca 

1621 gtggacgagt ttcctgtctg cggcgtcgac cgcgatctac gtttacatgt actcctttta 

1681 ctactacttc ttcaaaacca agatgtatgg cttatttcaa acatcatttt actttggcta 

1741 catggcggtg tttagcacag ccttggggat aatgtgtgga gcaatcggtt acatgggaac 

1801 aagtgccttt gttcgaaaaa tctatactaa tgtgaaaatt gactagatat ctgagacaac 

1861 atggaacttc ggatcagttt ctctttcatg agggtggaac ttgcacagaa aagaagaata 

1921 atcgcaagaa gagatttggg cttaacactg ggtactttgt gggtctcttt tgtggatggc 

1981 ttcaagtaac atctatttcc attgatccta ggttcttact gactgctttc tccagctgtt 

2041 cacagcagat gcttggattt tatgcagtag gctttactac agtacatggc taatcttccc 

2101 caaactagct cattaaagat gaaatagacc agctctcttc agtgaagagg acaaatagtt 

2161 tatttaaagc atttgttcca ataaaataac tagagggaaa cttggatgct aaaattccat 

2221 aagtagaaaa atcttcctgg cacttggtgt ttctatgtta ttgaaaaatg atgttccaga 

2281 aagattactt tttcttttat ttttactgcc attgttgatg tgttgtggga catttttata 

2341 tattgaacct gggttctttt ttgatctgtt ttggtcgtcc ctccattcaa ctgcatcttt 

2401 tttttttttt aaaaacaaaa ttaaaagcta ttcaattctg catggaatat atctgctgtc 

2461 atcttaatct tagtttggat caacctcatt tatttatctt aaaattaacc agctttctca 

2521 gttccatcca aagaagatac agtatgaaag tagaggtgca ctctctataa tacaatctac 

2581 tgtacagtta gcaagcgaag tgttaaatgg agaagatgct tgtttttatt aaacatttgg 

2641 agatttagat aatttacatt ttaactgaat gtctaaagtg attatcttcc cctccctccc 

2701 ccaagttaat cttccatctt tgtttttggg tttcaatgaa ggctttacat aagaaattat 

2761 taaacaaaac aaggggggca ataaatgtat ataacattaa taatgtcgcg taggtgtagg 

2821 ttcctggatg catttggatg tacagactga ctgggggact gtattgcgat gatgattggt 

2881 gtagaaacgt atgggtctgg gtggcctttt acatcttgcc taccattgca tgaaacattg 

2941 gggtttcttc aaaatgtgtg tgtcatactt cttttgggag ggggttgttt tcttctttta 

3001 tgaaactcct actggagctc aatttgtaat ttagaaacat ttaattttgt taatcctgtc 

3061 tgggacattt aggtaacatc taaagcatta ttgctttaga atgttcaaat aaaatttcct 

3121 gaccaaaaaa aaaaaaaaaa aaa 
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□ 1: AY071401. Drosophila melano...[gi:1 7945956] 



Links 



2662 bp mRNA linear INV 20-DEC-2001 



LOCUS AY071401 

DEFINITION Drosophi la melanogaster RE48767 full length cDNA. 
ACCESSION AY071401 
VERSION AY071401. 1 Gl : 17945956 
KEYWORDS FLLCDNA. 

SOURCE Drosophi la melanogaster (fruit fly) 
ORGAN I SM Drosophila melanogaster 

Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta: Pterygota; 
Neoptera; Endopterygota ; Diptera; Brachycera: Muscomorpha; 
Ephydro i dea ; Drosoph i I i dae ; Drosoph i I a. 
REFERENCE 1 (bases 1 to 2662) 
AUTHORS Stap I eton, M. , Brokstei n, P. , Hong, L. , Agbayani.A. , Carlson. J., 

Champe, M. , Chavez, C, Dorsett, V. , Dresnek. D. , Farfan, D. . Frise, E. , 
George, R. , Gonzalez, M. , Guarin, H. , Kronmi I ler, B. , Li. P.. Liao, G. . 
Miranda. A., Munga 1 1 . C. J. , Nunoo. J. , Pacleb, J. , Paragas, V. . Park. S. , 
Pate I, S. , Phouanenavong, S. , Wan. K. , Yu.C. . Lewis. S. E. . Rubin, G. M. 
and Celniker. S. 
TITLE Direct Submission 

JOURNAL Submitted (18-DEC-2001) Berkeley Drosophila Genome Project, 
Lawrence Berkeley National Laboratory, One Cyclotron Road, 
Berkeley, CA 94720. USA 
COMMENT Sequence submitted by: 

Berkeley Drosophila Genome Project 
Lawrence Berkeley National Laboratory 
Berkeley. CA 94720 

This clone was sequenced as part of a high-throughput process to 
sequence clones from Drosophila Gene Collection 1 (Rubin et al., 
Science 2000). The sequence has been subjected to integrity checks 
for sequence accuracy, presence of a polyA tail and contiguity 
within 100 kb in the genome; Thus we believe the sequence to 
reflect accurately this particular cDNA clone. However, there are 
artifacts associated with the generation of cDNA clones that may 
have not been detected in our initial analyses such as internal 
priming, priming from contaminating genomic DNA, retained introns 
due to reverse transcription of unspliced precursor RNAs, and 
reverse transcriptase errors that result in single base changes. 
For further information about this sequence, including its location 
and relationship to other sequences, please visit our Web site 
( http : //f ru i tf I v. berke I ey. edu ) or send emai I to 
cdnaff ru i tf I y. berke I ey. edu. 
FEATURES Location/Qual if iers 

source 1. .2662 

/organ ismF^Drosophi la melanogaster" 

/mol_type=*mRNA* 

/strain="y; cn bw sp" 

/db xref="taxon:7227~ 

/map="64E3-64E3" 
gene 1..2662 

/gene="CG10590" 

/note-"a I i gnment with genomic scaffold AE003565" 
/db xref ="FLYBASE : FBgn0035622" 
CDS 178.. 1956 

/gene="CG10590~ 
/note="Longest 0RF" 
/codon_start=1 
/product="RE48767p" 
/prote i n_i d=" AAL49023. V 
/db_xref="G I: 17945957" 
/db.xref ="FLYBASE : FBgn0035622" 

/trans I at i on="MGSSATQTTVPFRA I CLCLL I ASCYVSLSOADEHNHKYNDREEV 
VLWMNTVGPYHNRQETYAYFSLPFCSGQKSS I SHYHETLSEALQGVELEFSGYEMEFK 
SDAPKSV I CMVTLQEESAKAFTYAVKNEYWYQMY I DGLP I WGKVGERDERDGKYY I FT 
HKKFD I GYNGQQ I VD I TLTTEGREELKPGSH I NFSYEVNWKPSKVEFKNRFDKYLDPN 
FFQHR I HWFS I FNSFMMV I FLVGLVSM I LMRTLRKDYARYSKDEE I DDMERDLGDEYG 
WKQV HG DV FRS PPN TLLFS ALVG AG YQL I SVVFCV I MFAI VGELYTERGSMLSTA I FV 
YAATSP I NGYFGGSLYARLGGRMW I RQMLVSAFTVPVAVCGTAFL I NF I A I GYHASRA 
I PFGTMVAVTC I CLFV I LPLTLVGTVVGRNLDGQPDFPCRVNAVPRP I PEKKWYMEPL 
1 1 VLLGGVLPFGS I F I EMYF I FTSFWAYK I YYVYGFMLLVFS 1 LTVVTVCVT I VCTYF 
LLNAEDYRWQWTSFMAAGSTS I YVYAYSFYYFFFKTKMFGLFOT AFYFGYMALFSGAL 
Gl I CGTVGYVGTNLFVRK I YSNVK I D" 
BASE COUNT 672 a 651 c 613 g 726 t 
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ORIGIN 



atcgatgttt 
agtgtttgcc 
gcagagcgag 
ggcagcagcg 
gccagctgct 
gaggaggtgg 
gcgtacttct 
acgctgagcg 
aagagcgacg 
gcattcacct 
atttggggaa 
aagaagttcg 
ggccgcgagg 
cccagcaagg 
cacaggatcc 
ctggtgtcca 
gaggaaatcg 
ggcgatgtct 
taccaactga 
acggaacgcg 
aatggatact 
atgctggtgt 
ttcattgcca 
acttgcatct 
aatctggacg 
gaaaagaagt 
ggatccatct 
tacgtctacg 
accatcgtgt 
ttcatggctg 
tttaaaacca 
ttcagcggcg 
gtgcgcaaaa 
cccagatctg 
tctgtttcac 
ttctcatgtt 
tcagtttaac 
taagactaga 
tgttgcattt 
tttgtaattt 
cttgcttaca 
tatgttacgt 
ttatcaatct 
tgtacgatgc 
tataaaacca 



ctccacctct 
acaaaactgt 
gtcaccagga 
ccacgcaaac 
acgtgtcgct 
tactgtggat 
ctctcccctt 
aggcgctgca 
cccccaaatc 
atgccgtgaa 
aagtcggtga 
acatcggcta 
aactcaagcc 
tggagttcaa 
actggttcag 
tgattctgat 
acgacatgga 
tccgttctcc 
tttcggttgt 
gctccatgct 
ttggaggatc 
ccgcttttac 
ttggatatca 
gcctgtttgt 
gccaaccaga 
ggtacatgga 
tcattgagat 
gcttcatgtt 
gcacctactt 
cgggctccac 
aaatgttcgg 
ccttgggcat 
tctattccaa 
tatatcatcc 
aactttagct 
ggtgtgaaaa 
gatgtatacg 
cgtttcaaat 
acgttttgct 
ctactgttcc 
caaaatattt 
tgcatacggt 
atgcaactgc 
aagaaatgtg 
aaaaaaaaaa 



aagatacaac 
ttgaaaacgc 
agacgcgcaa 
cacggtgccg 
ctcccaggca 
gaacacggtg 
ttgcagtggc 
aggagtcgag 
ggtcatctgc 
gaacgagtac 
gcgcgacgag 
caatggccag 
gggatcccac 
gaatcgattc 
catcttcaac 
gcgaactctg 
gcgagatctt 
gcccaacaca 
attctgtgtg 
gtccacggct 
gctctatgcc 
agttccagtg 
cgcgtcgaga 
catcctgccc 
ctttccatgc 
gccactgatt 
gtacttcatc 
gctggttttc 
cctgctaaat 
gtcgatttac 
tctgttccaa 
tatctgcggc 
tgtgaaaata 
taatcgatta 
ccagttggct 
tgaatgccag 
caaactacat 
gagaaaattt 
agttttaaga 
ataaaataac 
gatatatttc 
gtacatcgag 
aacaaaatgt 
ctgattagca 
aa 



cctaaaatat 
aatcccagcg 
acgaggtcac 
ttccgtgcaa 
gatgagcaca 
ggcccgtacc 
cagaagtcct 
ctagagttca 
atggtcacct 
tggtaccaga 
cgcgatggca 
caaatcgtgg 
atcaacttct 
gacaagtacc 
agcttcatga 
cgcaaggatt 
ggtgatgaat 
ctgctcttct 
atcatgttcg 
atatttgtgt 
cgcctgggtg 
gctgtgtgcg 
gccattccct 
ttgactctgg 
cgcgtcaacg 
attgtgcttc 
ttcacctcct 
agcatcctga 
gccgaagatt 
gtgtacgcct 
acggccttct 
accgtcggct 
gactaagagc 
acaattgtat 
ctcctctttt 
tccaatatat 
atatcaccta 
gggaaaacat 
gtcaattaat 
atttctctac 
cgaagataac 
tattttattt 
gacttgaact 
atagtcgata 



ttctgctgtc 
agggcaaccg 
cacctgcaca 
tttgtttatg 
atcacaagta 
acaatcggca 
cgatatcgca 
gtggctacga 
tgcaggagga 
tgtacatcga 
agtactatat 
atatcaccct 
cctacgaggt 
tggatcccaa 
tggtcatctt 
atgctcggta 
acggctggaa 
cggcgttggt 
ccatagttgg 
atgccgccac 
gacgcatgtg 
gcacggcttt 
tcggtaccat 
tgggtactgt 
cggtgccacg 
ttggcggtgt 
tctgggcgta 
ctgtggtcac 
accgatggca 
attccttcta 
actttggcta 
atgtgggcac 
cctgcgactg 
ttattgagta 
cgtctctttg 
tcgcaccaaa 
aaaggtttac 
tgtgaattaa 
cctatacact 
ttacttaaat 
ttttatgtca 
gaattgtttg 
tgtttgcgcc 
atctaataat 



ggcaaatctg 
aatcaggccg 
agacacaatg 
tctgctgatc 
caatgaccgg 
ggagacgtac 
ctaccacgag 
gatggagttc 
gagcgccaag 
tggactgccc 
cttcacgcac 
gaccaccgag 
caactggaag 
cttcttccag 
cctggtgggt 
cagtaaggac 
gcaggtgcat 
gggcgctgga 
tgaattgtac 
ctcaccaatc 
gatccgacag 
cctgatcaac 
ggtggcggtc 
cgtgggccgc 
acccattccc 
cttgcccttt 
caagatctac 
cgtgtgcgtc 
gtggacgagt 
ttacttcttc 
catggcactc 
gaatctcttc 
caaacccacc 
gagttttagt 
ggttacccac 
gataaaagat 
ttcggaatta 
aagctctttc 
aatttttggt 
tactaagaag 
tatatgtgcg 
cctagaaaac 
aattaagtta 
aaattcaata 
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dbEST Id: 

EST name: 
GenBank Acc : 
GenBank gi : 

CLONE INFO 

Clone Id: 
DNA type: 

PRIMERS 

PolyA Tail : 

SEQUENCE 



Entry Created: 
Last Updated: 

COMMENTS 



LIBRARY 

Lib Name : 
Organism: 
Sex: 
Organ : 

Tissue type: 
Develop, stage: 
Vector: 
Description: 



SUBMITTER 

Name : 
Lab: 

Institution: 
Address : 
Tel : 
Fax : 
E-mail : 

CITATIONS 

Medline UID: 



38740 

HSB72F051 

Z28655 

434270 



72F05 
CDNA 



Unknown 



CAGATGTCATATTCAGTAAAATGGAAAAAGTCAGATGTGAAATTTGANGATCGATTTGAC 
AAATATCTTGATCCGTCCTTTTTTCAACATCGGATTCATTGGTTTTCAATTTTCAACTCC 
TTCATGATGGTGATCTTCTTGGTGGGCTTAGTTTCAATGATTTTAATGAGAACATTAAGA 
AAAGATTATGCTCGGTACAGTAAAGAGGAAGAAATGGATGATATGGATAGAGACCTAGGA 
GATGAATATGGATGGAAACAGGTGCATGGAGATGTATTTAGACCATCAAGTCACCCA 

Dec 14 1993 
Dec 14 1993 



strand (-), single read 



STRATAGENE Human skeletal muscle cDNA library, cat. #936215. 

Homo sapiens 

Female 

leg muscle 

skeletal muscle 

19 years 

Lambda ZAP I I 

Tissue from female, 19 years old, normal leg muscle. Cloning 
vector is Lambda ZAPII, in vivo excision from lambda ZAPII 
to pBluescript SK(+). Genexpress library reference is B. 



Genethon 

Genexpress -Genethon 

Genethon Centre de recherche sur le Genome Humain 

l,rue de 1 1 Internationale , BP60 91002 EVRY Cedex, FRANCE 

33169472800 

33160778698 

genexpress@genethon . f r 



95277534 
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IMAGE: molecular integration of the analysis of the human 
genome and its expression 

Auffray,C, Behar,G., Bois,F., Bouchier,C, da Silva,C, 
Devignes , M . D . , Duprat,S., Houlgatte , R . , Jumeau, M.N. , Lamy,B. 
, Lorenzo, F., Mitchell, H., Mariage- Samson, R . , Pietu,G., 
Pouliot,Y., Sebastian! -Kabaktchis , C . , Tessier,A. 
C. R. Acad. Sci. Ill, Sci . Vie 318 (2): 263-272 1995 
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